15.10 - Error Characters - Teradata Database

Teradata Database International Character Set Support

prodname
Teradata Database
vrm_release
15.00
15.10
category
Configuration
User Guide
featnum
B035-1125-015K

Each client character set and each server character set includes an error character.

The following table lists the error characters associated with each client character set.

 

Client Character Set

Error Character

  • ASCII character set
  • KanjiSJIS_0S character set
  • KanjiEUC_0U character set
  • 0x1A

  • EBCDIC character set
  • KanjiEBCDIC character sets
  • 0x3F

  • KanjiSJIS_0S character set, GRAPHIC data (Returned only when exporting in Record or Indicator modes.)
  • KanjiEUC_0U character set, GRAPHIC data (Record and Field modes only.)
  • KanjiEBCDIC character sets, GRAPHIC data (Record mode only.)
  • 0xFFFD

  • Site-defined single-byte client character sets
  • TCHBIG5_1R0
  • SCHGB2312_1T0
  • HANGULKSC5601_2R4
  • SDSCHGB2312_2T0
  • SDTCHBIG5_3R0
  • SDHANGULKSC5601_4R4
  • KANJI932_1S0
  • SCHINESE936_6R0
  • TCHINESE950_8R0
  • HANGUL949_7R0
  • THAI874_4A0
  • LATIN1250_1A0
  • CYRILLIC1251_2A0
  • HEBREW1255_5A0
  • ARABIC1256_6A0
  • LATIN1254_7A0
  • LATIN1258_8A0
  • LATIN1252_3A0
  • DBC.Translation.TranslateOut of 0x1A

  • SCHEBCDIC935_2IJ
  • TCHEBCDIC937_3IB
  • HANGULEBCDIC933_1II
  • SDSCHEBCDIC935_6IJ
  • SDTCHEBCDIC937_7IB
  • SDHANGULEBCDIC933_5II
  • DBC.Translation.TranslateOut of 0x3F

    The following table lists the error characters for the server character sets.

     

    Form-of-Use

    Error Character

    LATIN server character set

    0x1A

    KANJI1 server character set

    0x1A

    KANJISJIS server character set

    0x1A

    UNICODE server character set

    U+FFFD

    GRAPHIC server character set

    U+FFFD

    GRAPHIC server character set, VARGRAPHIC function

    U+F8FF

    Note: 0x1A in LATIN/KANJI1/KANJISJIS and U+FFFD in UNICODE/GRAPHIC are used internally by Teradata as the error character; therefore, they are unusable as user data. The user cannot store or retrieve these values through Teradata.

    For other site-defined character sets, the DBC.Translation table defines the translation of single-byte character sets and the single-byte character portion of Kanji character sets. The error character for these character sets is derived from how they translate to the LATIN and KANJI1 error character 0x1A.

    This translation is defined in the column TranslateOut of the DBC.Translation table.

    You can determine this character using the following SELECT statement.

       SELECT SUBSTR(TRANSLATEOUT,27,1)
       FROM DBC.TRANSLATION
       WHERE TRANSLATENAME = character_set;

    Internal conversion between the server character sets preserves the error character.

    For example, when translating from LATIN to UNICODE to KANJISJIS, 0x1A translates to U+FFFD which translates to 0x1A.

    For the KANJI1 character set, conversion of multibyte characters never causes an error.

    Suppose that a column declared as CHAR(2) CHARACTER SET UNICODE stores CAPITAL LETTER A WITH MACRON ( Α ).

    The Teradata Database representation is as follows.

     

    01

    00

    00

    20

    A

     

    Selection from a KanjiEUC session successfully returns Α encoded in code set 3 (cs3) of EUC. The KanjiEUC representation is as follows.

     

    8F

    AA

    A7

    20

    A

     

    Assume that an ASCII session does a SELECT on the column containing the Α. The export width for the session is set to the expected defaults and the conversion options are set to ignore conversion.

    Because the character cannot be represented in ASCII, it is converted to the ASCII error character 0x1A. The representation of the results in ASCII is as follows.

     

    1A

    20

    not valid

     

    Selection from KanjiSJIS returns an error character.

     

    1A

    20

    20

    20

    not valid

     

     

     

    Selection from KanjiEBCDIC also returns an error character.

     

    3F

    40

    40

    40

    40

    40

    not valid