LATIN Server Character Set - Teradata Database

International Character Set Support

Product
Teradata Database
Release Number
15.10
Language
English (United States)
Last Update
2018-09-25
dita:id
B035-1125
lifecycle
previous
Product Category
Teradata® Database

The Teradata LATIN server character set is an extension of the R2 international character set. This extension is designed to provide support for the ISO 8859-1 (Latin1) and ISO 8859-15 (Latin9) repertoires.

This internal Teradata Database form-of-use is also the client form-of-use when the client character set is ASCII (although seven-bit ASCII is not affected).

Clients accessing the LATIN server character set can use any of the following single-byte client character sets:

  • ASCII
  • LATIN1_0A
  • LATIN9_0A
  • LATIN1252_0A
  • EBCDIC
  • EBCDIC037_0E
  • EBCDIC277_0E
  • EBCDIC273_0E
  • Simple custom (site-defined) client character sets based ASCII or EBCDIC
  • For details on LATIN-compatible client character sets, see Chapter 2: “Client Character Set Options.”

    When used in the Teradata Database, ASCII and Teradata LATIN are identical on all code points except the 80-FF range, where Teradata LATIN defines additional West European letters.

    Code points outside the seven-bit ASCII range result in data that may not behave as intended.

    You can detect code points that conflict with ASCII semantics either before or after migrating to Teradata LATIN. Corrective action can then be taken under either environment.

       SELECT cchar 
       FROM tab1
       WHERE 
       INDEX(cchar, '80'XC) <> 0
       OR INDEX(cchar, '81'XC) <> 0
       OR INDEX(cchar, '82'XC) <> 0
       .
       /* all codes 83-FE */
       .
       OR INDEX(cchar, 'FF'XC) <> 0;

    If you intend to use characters with code points above ASCII X’7F’, take the following steps:

    1 Install, as appropriate, LATIN1_0A, LATIN9_0A, EBCDIC037_0E, or a site-defined character set that matches the character set used on your client.

    2 Set the client character set to LATIN1_0A, LATIN9_0A, EBCDIC037_0E, or the name of the site-defined character set that you installed in Step 1 in order to use the encodings.

    The preceding applies if non-ASCII characters are desired.

    Teradata Database Japanese language facilities support ISO 8859-1 (Latin1) with the restriction that the code points 80-85 are not allowed.

    The characters at 80-85 are in the C1 control region and are not considered to be essential.

    This restriction is scheduled to be removed in a future release.

    Teradata Database Japanese language facilities support ISO 8859-15 (Latin9) with the restriction that the code points 80-85 and the LETTER Z WITH CARON are not allowed.

    The characters at 80-85 are in the C1 control region and are not considered to be essential. The character LETTER Z WITH CARON is part of the support for Finnish, but is not considered part of the Finnish alphabet and is rarely used even in Finland.

    These restrictions are scheduled to be removed in a future release.

    For supported LATIN characters, see LATIN Server Character Set, available on the documentation CD and on the Web at http://www.info.teradata.com/.

    For additional information on the LATIN server character set, see SQL Data Types and Literals.