Character Sets - Teradata Vantage NewSQL Engine - 16.20

Teradata Vantageā„¢ XML Data Type

prodname
Teradata Database
Teradata Vantage NewSQL Engine
vrm_release
16.20
category
Programming Reference
featnum
B035-1140-162K

XML documents declare their encoding in the XML declaration. The XML type implementation parses and stores XML in the database with the character data encoded in UTF-8 encoding. XML documents transferred from the client to the server using the text format are expected to be encoded in UTF-8. The encoding specified in the XML declaration is ignored by the XML type implementation on the server side in this case. Similarly, XML type values transferred from server to client in the text format are encoded in UTF-8.

Xerces supports the following encodings out of the box.
  • ASCII
  • UTF-8
  • UTF-16 (big/little endian)
  • UCS4 (big/little endian)
  • EBCDIC code pages IBM037, IBM1047 and IBM1140
  • ISO-8859-1 (Latin1)
  • Windows-1252
In the base case, documents can always be loaded and returned as the following:
  • XML type because UTF-8 is supported out of the box by Xerces
  • VARCHAR/CLOB because the transcoding to UTF-8/16 is handled by the DBS

Documents loaded as BLOBs can only be in the supported encodings.