Standard Character Sets
The following table lists the standard character sets which are supported by the database.
Standard Character Sets | |
---|---|
System Configuration | Name |
Mainframe-attached | EBCDIC |
Workstation-attached | ASCII |
The terms ASCII and EBCDIC are often used in ambiguous ways, and this presents a difficulty for accented and non-Latin characters. The user should select a client character set that exactly matches the character set that the import data uses.
If accented and non-Latin characters are used, do not use the ASCII or EBCDIC client character sets. Instead, load and use one of the other Teradata-supplied character sets, or a site-defined character set that exactly matches the application character set, such as: EBCDIC037_0E for mainframe-attached clients (for the United States or Canada), LATIN1_0A, LATIN9_0A (for Western European languages), LATIN1252_0A for Western European Microsoft® Windows clients, or UTF-8 for UNIX system-based clients.
Japanese Characters Sets
The following table lists the Japanese character sets which are supported by the database.
Japanese Character Sets | |
---|---|
System Configuration | Character Set Name |
Mainframe-attached | KATAKANAEBCDIC KANJIEBCDIC5026_0I KANJIEBCDIC5035_0I |
Workstation-attached | KANJIEUC_0U KANJISJIS_0S |
For more information on kanji character sets, see International Character Set Support, B035-1125.
Chinese and Korean Character Sets
Chinese and Korean character sets are available for mainframe- and workstation-attached client systems.
Chinese Character Sets | |
---|---|
System Configuration | Name |
Mainframe-attached | SCHEBCDIC935_2IJ TCHEBCDIC937_3IB |
Workstation-attached | SCHGB2312_1T0 TCHBIG5_1R0 |
Korean Character Sets | |
---|---|
System Configuration | Name |
Mainframe-attached | HANGULEBCDIC933_1II |
Workstation-attached | HANGULKSC5601_2R4 |
Rules for Using Chinese and Korean Character Sets
- Object Names
The database supports multi-byte characters in object names when the client session character set is UTF-8 or UTF-16. For a list of valid and non-valid characters when multi-byte object names are used, see International Character Set Support, B035-1125.
If multi-byte characters are used in object names in Teradata TPump script, they must be enclosed in double quotes.
- Maximum String Length
The database requires two bytes to process each of the Chinese or Korean characters. This limits both request size and record size. For example, if a record consists of one string, the length of that string is limited to a maximum of 32,000 characters or 64,000 bytes.