UNICODE Server Character Set | Data Types and Literals | Teradata Vantage - 17.10 - UNICODE Server Character Set - Advanced SQL Engine - Teradata Database

Teradata Vantage™ - Data Types and Literals

Product
Advanced SQL Engine
Teradata Database
Release Number
17.10
Release Date
July 2021
Content Type
Programming Reference
Publication ID
B035-1143-171K
Language
English (United States)

Intended Use

International applications using the Unicode repertoire or a subset thereof.

Pad Character

SPACE (U+0020)

SQL Declaration

To specify the UNICODE server character set for a character column, use the following syntax.

Data Type Maximum Value for n
CHARACTER(n) CHARACTER SET UNICODE 32000

(This is also the size of LONG VARCHAR CHARACTER SET UNICODE.)

VARCHAR(n) CHARACTER SET UNICODE
CLOB(n) CHARACTER SET UNICODE 1048544000

Usage Notes

The UNICODE server character set supports the 16-bit BMP characters from Unicode® 6.0. For a list of the supported characters, see Teradata Vantage™ - Advanced SQL Engine International Character Set Support, B035-1125.

Unicode also allows the storage and searching of all other 16-bit and 32-bit Unicode characters. For more information, see Teradata Vantage™ - Advanced SQL Engine International Character Set Support, B035-1125.

Each code point represents a distinct character, including non-spacing characters such as diacritical marks and joiners. All characters named as letters in UNICODE are considered as such, and are candidates for uppercasing.

Database storage space for UNICODE is allocated on a character basis. 32-bit Pass Through Characters (for example, emoji) require two 16-bit UTF-16 code units, or 4 bytes. Therefore, VARCHAR(2) or CHAR(2) are the minimum sizes required to store a Pass Through Character.

For details about the UNICODE server character set, see Teradata Vantage™ - Advanced SQL Engine International Character Set Support, B035-1125.