UNICODE Server Character Set

Teradata Vantage™ Data Types and Literals

prodname
Teradata Database
Teradata Vantage NewSQL Engine
vrm_release
16.20
category
Programming Reference
featnum
B035-1143-162K

Intended Use

International applications using the Unicode repertoire or a subset thereof.

Pad Character

SPACE (U+0020)

SQL Declaration

To specify the UNICODE server character set for a character column, use the following syntax.

Data Type Maximum Value for n
CHARACTER(n) CHARACTER SET UNICODE 32000

(This is also the size of LONG VARCHAR CHARACTER SET UNICODE.)

VARCHAR(n) CHARACTER SET UNICODE
CLOB(n) CHARACTER SET UNICODE 1048544000

Usage Notes

The UNICODE server character set supports the 16-bit BMP characters from Unicode® 6.0. For a list of the supported characters, see Character Set Files in Teradata Vantage™ NewSQL Engine International Character Set Support, B035-1125.

Unicode also allows the storage and searching of all other 16-bit and 32-bit Unicode characters. For more information, see Unicode Pass Through in Teradata Vantage™ NewSQL Engine International Character Set Support, B035-1125.

Each code point represents a distinct character, including non-spacing characters such as diacritical marks and joiners. All characters named as letters in UNICODE are considered as such, and are candidates for uppercasing.

Database storage space for UNICODE is allocated on a character basis. 32-bit Pass Through Characters (for example., emoji) require two 16-bit UTF-16 code units, or 4 bytes. Therefore, VARCHAR(2) or CHAR(2) are the minimum sizes required to store a Pass Through Character.

For details about the UNICODE server character set, see Teradata Vantage™ NewSQL Engine International Character Set Support, B035-1125.