16.20 - Usage Notes - Teradata Tools and Utilities

Teradata® Call-Level Interface Version 2 Reference for Mainframe-Attached Systems

prodname
Teradata Tools and Utilities
vrm_release
16.20
created_date
September 2019
category
Programming Reference
featnum
B035-2417-108K

NAME identifies the character set to which the description applies. The name may include a standard suffix that defines the encoding scheme. The standard suffix consists of an underscore, a number not relevant to CLIv2, the encoding character (A, E, I, R, S, T, or U), and an optional character not relevant to CLIv2. Each suffix corresponds to an ENCODING operand value:

  • E - EDBDIC
  • I - IBMSOSI
  • A - ASCII
  • R - BIGFIVE
  • S - SJIS
  • T - EUC-CN or EUC-KR
  • U - EUC-JP

ENCODING optionally identifies the encoding scheme for the character set. If omitted, the character set must contain a standard suffix that indicates the encoding. If such a suffix exists, then the encoding cannot be overridden using this operand. The following character sets are available in CLIv2.

ENCODING Meaning Characteristics
EBCDIC Extended Binary-Coded-Decimal Interchange Code
  • Single-byte (EBCDIC) codepoints:

    X'00' through X'FF'

IBMOSI IBM Shift-out/Shift-in
  • Single-byte (EBCDIC) codepoints:

    X'00' through X'FF'

  • Double-byte (EBDCIC) codepoints:

    Shift-out (X'0E') through Shift-in (X'0E')

ASCII American Standard Code for Information Interchange
  • Single-byte (ASCII) codepoints:

    X'00' through X'FF'

BIGFIVE Big Five Plus
  • Single-byte (ASCII) codepoints:

    X'00' through X'80', and X'FF'

  • Double-byte (ASCII) codepoints:

    X'81' through X'FE'

EUC-CN Extended Unix Code - China
  • Single-byte (ASCII) codepoints:

    X'00' through X'7F'

  • Double-byte (ASCII) codepoints:

    X'80' through X'FF'

EUC-JP Extended Unix Code - Japan
  • Single-byte (ASCII) codepoints:

    X'00' through X'8D'

    X'90' through X'FF'

  • Double-byte (ASCII) codepoints:

    Single-shift1 (X'8E')

  • Triple-byte (ASCII) codepoints:

    Single-shift2 (X'8F)'

EUC-KR Extended Unix Code - Korea
  • Single-byte (ASCII) codepoints:

    X'00' through X'7F'

  • Double-byte (ASCII) codepoints:

    X'80' through X'FF'

SJIS Shift-JIS (Japanese Industrial Standard)
  • Single-byte (ASCII) codepoints:

    X'00' through X'80'

    X'A0' through X'DF'

    X'FD' through X'FF'

  • Double-byte (ASCII) codepoints:

    X'81' through X'9F'

    X'E0' through X'FC'

UHC Unified Hangul Code
  • Single-byte (ASCII) codepoints:

    X'00' through X'80', and X'FF'

  • Double-byte (ASCII) codepoints:

    X'81' through X'FE'

UTF8 UCS (Universal Character Set) Transformation Format 8-bit
  • Single-byte (Unicode) codepoints:

    X'00' through X'7F'

  • Double-byte (Unicode) codepoints:

    X'C0' through X'DF'

  • (Most) triple-byte (Unicode) codepoints:

    X'E0' through X'FE'

Most four-byte codepoints (X'F0' through X'F4') are not supported by Teradata Database.

UTF16 UCS (Universal Character Set) Transformation Format- 16-bit
  • Single-byte (Unicode) codepoints:

    X'0000' through X'D7FF'

    X'E000' through X'FFFF'

Surrogates (four-byte codepoints that begin or end with the two-byte codepoints X'D800' through X'DBFF') are not supported by Teradata Database.

While all codepoints are reflected to and from Teradata Database, for character sets that allow mixtures of single and multi-byte characters, only the single-byte characters are meaningful to CLIv2.

Example

Begin definition for IBM Code Page 833, the single-byte component for IBM CCSID 933.

CHARSET NAME KOREAN_EBCDIC933 ENCODING IBMSOSI