Intended Use
International applications using the Unicode repertoire or a subset thereof.
Pad Character
SPACE (U+0020)
SQL Declaration
To specify the UNICODE server character set for a character column, use the following syntax.
Data Type | Maximum Value for n |
---|---|
CHARACTER(n) CHARACTER SET UNICODE | 32000 (This is also the size of LONG VARCHAR CHARACTER SET UNICODE.) |
VARCHAR(n) CHARACTER SET UNICODE | |
CLOB(n) CHARACTER SET UNICODE | 1048544000 |
Usage Notes
The UNICODE server character set supports the 16-bit BMP characters from Unicode® 6.0. For a list of the supported characters, see Teradata Vantage™ - Advanced SQL Engine International Character Set Support, B035-1125.
Unicode also allows the storage and searching of all other 16-bit and 32-bit Unicode characters. For more information, see Teradata Vantage™ - Advanced SQL Engine International Character Set Support, B035-1125.
Each code point represents a distinct character, including non-spacing characters such as diacritical marks and joiners. All characters named as letters in UNICODE are considered as such, and are candidates for uppercasing.
Database storage space for UNICODE is allocated on a character basis. 32-bit Pass Through Characters (for example, emoji) require two 16-bit UTF-16 code units, or 4 bytes. Therefore, VARCHAR(2) or CHAR(2) are the minimum sizes required to store a Pass Through Character.
For details about the UNICODE server character set, see Teradata Vantage™ - Advanced SQL Engine International Character Set Support, B035-1125.