The CHARSET directive explicitly begins a definition and possibly the encoding scheme.
Syntax
Usage Notes
- E – EBCDIC
- I – IBMSOSI
- A – ASCII
- R – BIGFIVE
- S – SJIS
- T – EUC-CN or EUC-KR
- U – EUC-JP
ENCODING optionally identifies the encoding scheme for the character set. If omitted, the character set must contain a standard suffix that indicates the encoding. If such a suffix exists, then the encoding cannot be overridden using this operand. The following character sets are available in TDP.
ENCODING | Meaning | Characteristics |
---|---|---|
EBCDIC | Extended Binary-Coded-Decimal Interchange Code |
|
IBMOSI | IBM Shift-out/Shift-in |
|
ASCII | American Standard Code for Information Interchange |
|
BIGFIVE | Big Five Plus |
|
EUC-CN | Extended Unix Code - China |
|
EUC-JP | Extended Unix Code - Japan |
|
EUC-KR | Extended Unix Code - Korea |
|
SJIS | Shift-JIS (Japanese Industrial Standard) |
|
UHC | Unified Hangul Code |
|
UTF8 | UCS (Universal Character Set) Transformation Format 8-bit |
Most four-byte codepoints (X'F0' through X'F4') are not supported by the database. |
UTF16 | UCS (Universal Character Set) Transformation Format- 16-bit |
Surrogates (four-byte codepoints that begin or end with the two-byte codpoints X'D800' through X'DBFF') are not supported by the database. |
When the NAME operand is specified, if this name does not match the character set name specified on the SET USERCS command, this directive and all directives until the next CHARSET directive are ignored. When the NAME operand is not specified, then this directive is used, which implies that any subsequent CHARSET directives in the file will never be processed since this one will always be used.
While all codepoints are reflected to and from the database, for character sets that allow mixtures of single and multi-byte characters, only the single-byte characters are meaningful in TDP command syntax.
Example: CHARSET
Begin definition for IBM Code Page 833, the single-byte component for IBM CCSID 933.
CHARSET NAME KOREAN_EBCDIC933 ENCODING IBMSOSI