International applications using the Unicode repertoire or a subset thereof.
To specify the UNICODE server character set for a character column, use the following syntax.
|Data Type||Maximum Value for n|
|CHARACTER(n) CHARACTER SET UNICODE||32000
(This is also the size of LONG VARCHAR CHARACTER SET UNICODE.)
|VARCHAR(n) CHARACTER SET UNICODE|
|CLOB(n) CHARACTER SET UNICODE||1048544000|
The UNICODE server character set supports the 16-bit BMP characters from Unicode® 6.0. For a list of the supported characters, see Character Set Files in Teradata Vantage™ NewSQL Engine International Character Set Support, B035-1125.
Unicode also allows the storage and searching of all other 16-bit and 32-bit Unicode characters. For more information, see Unicode Pass Through in Teradata Vantage™ NewSQL Engine International Character Set Support, B035-1125.
Each code point represents a distinct character, including non-spacing characters such as diacritical marks and joiners. All characters named as letters in UNICODE are considered as such, and are candidates for uppercasing.
Database storage space for UNICODE is allocated on a character basis. 32-bit Pass Through Characters (for example., emoji) require two 16-bit UTF-16 code units, or 4 bytes. Therefore, VARCHAR(2) or CHAR(2) are the minimum sizes required to store a Pass Through Character.
For details about the UNICODE server character set, see Teradata Vantage™ NewSQL Engine International Character Set Support, B035-1125.