17.00 - Client Character Sets - Parallel Data Pump

Teradata® Parallel Data Pump Reference

prodname
Parallel Data Pump
vrm_release
17.00
created_date
June 2020
category
Programming Reference
featnum
B035-3021-220K

Standard Character Sets

The following table lists the standard character sets which are supported by the database.

Standard Character Sets 
Standard Character Sets
System Configuration Name
Mainframe-attached EBCDIC
Workstation-attached ASCII

The terms ASCII and EBCDIC are often used in ambiguous ways, and this presents a difficulty for accented and non-Latin characters. The user should select a client character set that exactly matches the character set that the import data uses.

If accented and non-Latin characters are used, do not use the ASCII or EBCDIC client character sets. Instead, load and use one of the other Teradata-supplied character sets, or a site-defined character set that exactly matches the application character set, such as: EBCDIC037_0E for mainframe-attached clients (for the United States or Canada), LATIN1_0A, LATIN9_0A (for Western European languages), LATIN1252_0A for Western European Microsoft® Windows clients, or UTF-8 for UNIX system-based clients.

Japanese Characters Sets

The following table lists the Japanese character sets which are supported by the database.

Japanese Character Sets 
Japanese Character Sets
System Configuration Character Set Name
Mainframe-attached KATAKANAEBCDIC

KANJIEBCDIC5026_0I

KANJIEBCDIC5035_0I

Workstation-attached KANJIEUC_0U

KANJISJIS_0S

For more information on kanji character sets, see Teradata Vantage™ - Advanced SQL Engine International Character Set Support, B035-1125.

Teradata TPump statements do not accept object names specified in internal database hexadecimal form and do not display object names in hexadecimal form.

Chinese and Korean Character Sets

Chinese and Korean character sets are available for mainframe- and workstation-attached client systems.

Chinese Character Sets 
Chinese Character Sets
System Configuration Name
Mainframe-attached SCHEBCDIC935_2IJ

TCHEBCDIC937_3IB

Workstation-attached SCHGB2312_1T0

TCHBIG5_1R0

Korean Character Sets 
Korean Character Sets
System Configuration Name
Mainframe-attached HANGULEBCDIC933_1II
Workstation-attached HANGULKSC5601_2R4

Rules for Using Chinese and Korean Character Sets

Certain rules apply when using Chinese and Korean character sets on mainframe- and workstation-attached platforms.
  • Object Names

    The database supports multi-byte characters in object names when the client session character set is UTF-8 or UTF-16. For a list of valid and non-valid characters when multi-byte object names are used, see Teradata Vantage™ - Advanced SQL Engine International Character Set Support, B035-1125.

    If multi-byte characters are used in object names in Teradata TPump script, they must be enclosed in double quotes.

  • Maximum String Length

    The database requires two bytes to process each of the Chinese or Korean characters. This limits both request size and record size. For example, if a record consists of one string, the length of that string is limited to a maximum of 32,000 characters or 64,000 bytes.