16.20 - Areas of Unicode - Teradata Vantage NewSQL Engine

Teradata Vantageā„¢ NewSQL Engine International Character Set Support

prodname
Teradata Database
Teradata Vantage NewSQL Engine
vrm_release
16.20
created_date
March 2019
category
Configuration
User Guide
featnum
B035-1125-162K

It is often useful to divide the UNICODE character set into eight areas.

Area Description
General scripts Latin, Greek, Cyrillic, Hebrew, Arabic, Indic, and other characters.
Symbols Arrows, mathematical symbols, and punctuation.
CJK Phonetics and Symbols Hiragana, Katakana, and Bopomofo.

CJK stands for Chinese, Japanese, and Korean.

CJK Ideographs Chinese, Japanese, and Korean ideographs.
Hangul Syllables Complete set of modern Hangul.
Surrogates Code points designed to extend the range of Unicode within the ISO 10646 encoding scheme.
Private Use Area The Private Use Area contains characters for sharing site-defined characters from the KanjiEBCDIC, KanjiEUC, and KanjiShift-JIS client character sets.
Compatibility Zone The Compatibility Zone contains halfwidth and fullwidth variants of characters defined by Japanese standards and, among others, includes Hankaku (halfwidth) Katakana and fullwidth ASCII characters.


The first 1880 characters of the Private Use Area are used for sharing site-defined characters from KanjiEBCDIC, KanjiEUC, and KanjiShift-JIS client character sets.

The following table defines the Teradata Database use of the Private Use Area.

Name Unicode Range KanjiShift-JIS Range KanjiEUC Range KanjiEBCDIC Range Comments
Gaiji-1 U+E000 to U+E3AB 0xF040 to 0xF4FC

(1st 940 Gaiji)

0xF5A1 to 0xFEFE

(Rows 85-94 JIS X 0208)

0x6941 to 0x6DF4

(1st 940 Gaiji)

Shared by all. (940 characters)
Gaiji-2 U+E3AC to U+E757 0xF540 to 0xF9FC

(2nd 940 Gaiji)

0x8FF5A1 to 0x8FFEFE

(Rows 85-94 JIS X 0212)

0x6DF5 to 0x72EA

(2nd 940 Gaiji)

Shared by all. (940 characters)
Graphic Error Character U+F8FF Not applicable Not applicable Not applicable Associated with the VARGRAPHIC function