16.20 - KanjiEBCDIC Example - Teradata Vantage NewSQL Engine

Teradata Vantage™ NewSQL Engine International Character Set Support

prodname
Teradata Database
Teradata Vantage NewSQL Engine
vrm_release
16.20
created_date
March 2019
category
Configuration
User Guide
featnum
B035-1125-162K

For KanjiEBCDIC, the problem occurs when a single-byte character is placed between two sequences of multibyte characters.

For example, consider the following SELECT statement.

SELECT '平成1年';

平成1年, where the 1 is a single-byte character, is translated internally to UCS2.

5E 73 62 10 00 31 5E 74
1

Because this is four characters, the expected-defaults exports at most ten bytes, but the representation of 平成1年 in KanjiEBCDIC requires eleven bytes.

0E 45 8D 45 BA 0F 31 0E 45 60 0F
< > 1 < >

Therefore the output must be truncated to ten bytes.

Truncation occurs at a character boundary, so the output is truncated to 平成1, which requires seven bytes.

If this were a fixed width field, these seven bytes would be padded with SPACE characters to produce ten bytes. Because the field is not fixed width, only seven bytes are output (excluding the length information which is always included in variable width data).

0E 45 8D 45 BA 0F 31
< > 1