UNICODE Server Character Set | Data Types and Literals | VantageCloud Lake - UNICODE Server Character Set - Teradata Vantage

Teradata® VantageCloud Lake

Deployment
VantageCloud
Edition
Lake
Product
Teradata Vantage
Published
January 2023
ft:locale
en-US
ft:lastEdition
2024-12-11
dita:mapPath
phg1621910019905.ditamap
dita:ditavalPath
pny1626732985837.ditaval
dita:id
phg1621910019905

Intended Use

International applications using the Unicode repertoire or a subset thereof.

Pad Character

SPACE (U+0020)

SQL Declaration

To specify the UNICODE server character set for a character column, use the following syntax.

Data Type Maximum Value for n
CHARACTER(n) CHARACTER SET UNICODE 32000

(This is also the size of LONG VARCHAR CHARACTER SET UNICODE.)

VARCHAR(n) CHARACTER SET UNICODE
CLOB(n) CHARACTER SET UNICODE 1048544000

Usage Notes

The UNICODE server character set supports the 16-bit BMP characters from Unicode® 6.0. For a list of the supported characters, see Supported UNICODE Client Character Sets .

Unicode also allows the storage and searching of all other 16-bit and 32-bit Unicode characters. For more information, see UNICODE PASS THROUGH.

Each code point represents a distinct character, including non-spacing characters such as diacritical marks and joiners. All characters named as letters in UNICODE are considered as such, and are candidates for uppercasing.

Database storage space for UNICODE is allocated on a character basis. 32-bit Pass Through Characters (for example, emoji) require two 16-bit UTF-16 code units, or 4 bytes. Therefore, VARCHAR(2) or CHAR(2) are the minimum sizes required to store a Pass Through Character.

See UNICODE Server Character Set .