Character Sets - Analytics Database - Teradata Vantage

XML Data Type

Deployment
VantageCloud
VantageCore
Edition
Enterprise
IntelliFlex
VMware
Product
Analytics Database
Teradata Vantage
Release Number
17.20
Published
June 2022
Language
English (United States)
Last Update
2023-10-30
dita:mapPath
tkc1628112506748.ditamap
dita:ditavalPath
qkf1628213546010.ditaval
dita:id
dgs1472251600184
lifecycle
latest
Product Category
Teradata Vantageā„¢

XML documents declare their encoding in the XML declaration. The XML type implementation parses and stores XML in the database with the character data encoded in UTF-8 encoding. XML documents transferred from the client to the server using the text format are expected to be encoded in UTF-8. The encoding specified in the XML declaration is ignored by the XML type implementation on the server side. Similarly, XML type values transferred from server to client in the text format are encoded in UTF-8.

Xerces supports the following encodings out of the box.
  • ASCII
  • UTF-8
  • UTF-16 (big/little endian)
  • UCS4 (big/little endian)
  • EBCDIC code pages IBM037, IBM1047 and IBM1140
  • ISO-8859-1 (Latin1)
  • Windows-1252
In the base case, documents can always be loaded and returned as the following:
  • XML type because UTF-8 is supported out of the box by Xerces
  • VARCHAR/CLOB because the transcoding to UTF-8/16 is handled by the DBS

Documents loaded as BLOBs can only be in the supported encodings.