16.20 - KATAKANAEBCDIC - Teradata Tools and Utilities

Teradata® Call-Level Interface Version 2 Reference for Mainframe-Attached Systems

prodname
Teradata Tools and Utilities
vrm_release
16.20
created_date
September 2019
category
Programming Reference
featnum
B035-2417-108K

The character set on the Teradata Database named KATAKANAEBCDIC5026_0I is intended as an extended EBCDIC character set consisting of both one and two-bytes per character. Architecturally, the EBCDIC encoding scheme consists of 256 possible values (codepoints) represented as hexadecimal values in the range X'00' to X'FF'

To support more than 256 codepoints, the EBCDIC encoding scheme is extended by defining the Shift-out control character to switch from one byte per character to two bytes per character until the Shift-in control character is encountered. The first byte of codepoints between the Shift-out and Shift-in control characters is always between X'41' and X'FE'. Currently, the second byte is also between X'41' and X'FE'. The X'4040' codepoint is defined as the Double-byte Space character. No double-byte control characters exist. The double-byte characters are not described.

Single-byte Teradata KATAKANAEBCDIC Codepage
  x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xA xB xC xD xE xF
0 x NUL SOH STX ETX   HT       y   VT FF CR SO SI
1 x ¢   ¬ / a   BS b CAN EM     IS4 IS3 IS2 IS1
2 x           LF ETB ESC           ENQ ACK BEL
3 x     SYN     c   EOT         ~ NAK    
4 x SP
£ · < ( + |
5 x &
 
  ! ¥ * ) ; ^
6 x - /     d e f g h i j ¸ % _ > ?
7 x p q r s t u v w x ´ : # @ ' = "
8 x  
 
9 x
   
A x [
 
Bx ]                  
C x { A B C D E F G H I z k l m n o
D x } J K L M N O P Q R            
E x $   S T U V W X Y Z            
F x 0 1 2 3 4 5 6 7 8 9            
                                 
Control character codepoints                    
Reserved codepoints                        
Katakana codepoints. Refer to the table titled Katakana Codepoint Assignments for KATAKANAEBCDIC for details.              

The Database will reject character data containing any single or double byte reserved codepoint and will not identify which invalid codepoint was present.

Katakana Codepoint Assignments for KATAKANAEBCDIC 
  IBM GCGID IBM description
codepoint Unicode code Unicode name
41 JQ700000 Katakana full stop
U+FF61 Halfwidth Ideographic Full Stop
42 JQ710000 Katakana left Bracket
U+FF62 Halfwidth Left Corner Bracket
43 JQ720000 Katakana right Bracket
U+FF63 Halfwidth Right Corner Bracket
44 JQ730000 Katakana comma
U+FF64 Halfwidth Ideographic Comma
45 JQ74000 Katakana conjunctive symbol
U+FF65 Halfwidth Katakana Middle Dot
46 JW500000 Katakana 'WO'
U+FF66 Halfwidth Katakana Letter 'WO'
47 JA010000 Katakana 'a'
U+FF67 Halfwidth Katakana Letter Small 'a'
48 JI010000 Katakana 'i'
U+FF68 Halfwidth Katakana Letter Small 'i'
49 JU010000 Katakana 'u'
U+FF69 Halfwidth Katakana Letter Small 'u'
51 JE010000 Katakana 'e'
U+FF6A Halfwidth Katakana Letter Small 'e'
52 JO010000 Katakana 'o'
U+FF6B Halfwidth Katakana Letter Small 'o'
53 JY110000 Katakana 'ya'
U+FF6C Halfwidth Katakana Letter Small 'ya'
54 JY310000 Katakana 'yu'
U+FF6D Halfwidth Katakana Letter Small 'yu'
55 JY510000 Katakana 'yo'
U+FF6E Halfwidth Katakana Letter Small 'yo'
56 JT310000 Katakana 'tu'/'tsu'
U+FF6F Halfwidth Katakana Letter Small 'tu'
58 JX700000 Katakana prolonged sound symbol
U+FF70 Halfwidth Katakana-Hiragana prolonged sound mark
81 JA000000 Katakana 'A'
U+FF71 Halfwidth Katakana Letter 'A'
82 JI000000 Katakana 'I'
U+FF72 Halfwidth Katakana Letter 'I'
83 JU000000 Katakana 'U'
U+FF73 Halfwidth Katakana Letter 'U'
84 JE000000 Katakana 'E'
U+FF74 Halfwidth Katakana Letter 'E'
85 JO000000 Katakana 'O'
U+FF75 Halfwidth Katakana Letter 'O'
86 JK100000 Katakana 'KA'
U+FF76 Halfwidth Katakana Letter 'KA'
87 JK200000 Katakana 'KI'
U+FF77 Halfwidth Katakana Letter 'KI'
88 JK300000 Katakana 'KU'
U+FF78 Halfwidth Katakana Letter 'KU'
89 JK400000 Katakana 'KE'
U+FF79 Halfwidth Katakana Letter 'KE'
8A JK500000 Katakana 'KO'
U+FF7A Halfwidth Katakana Letter 'KO'
8C JS100000 Katakana 'SA'
U+FF7B Halfwidth Katakana Letter 'SA'
8D JS200000 Katakana 'SI'/'SHI'
U+FF7C Halfwidth Katakana Letter 'SI'
8E JS300000 Katakana 'SU'
U+FF7D Halfwidth Katakana Letter 'SU'
8F JS400000 Katakana 'SE'
U+FF7E Halfwidth Katakana Letter 'SE'
90 JS500000 Katakana 'SO'
U+FF7F Halfwidth Katakana Letter 'SO'
91 JT100000 Katakana 'TA'
U+FF80 Halfwidth Katakana Letter 'TA'
92 JT200000 Katakana 'TI'/'CHI'
U+FF81 Halfwidth Katakana Letter 'TI'
93 JT300000 Katakana 'TU'/'TSU'
U+FF82 Halfwidth Katakana Letter 'TU'
94 JT400000 Katakana 'TE'
U+FF83 Halfwidth Katakana Letter 'TE'
95 JT500000 Katakana 'TO'
U+FF84 Halfwidth Katakana Letter 'TO'
96 JN100000 Katakana 'NA'
U+FF85 Halfwidth Katakana Letter 'NA'
97 JN200000 Katakana 'NI'
U+FF86 Halfwidth Katakana Letter 'NI'
98 JN300000 Katakana 'NU'
U+FF87 Halfwidth Katakana Letter 'NU'
99 JN400000 Katakana 'NE'
U+FF88 Halfwidth Katakana Letter 'NE'
9A JN500000 Katakana 'NO'
U+FF89 Halfwidth Katakana Letter 'NO'
9D JH100000 Katakana 'HA'
U+FF8A Halfwidth Katakana Letter 'HA'
9E JH200000 Katakana 'HI'
U+FF8B Halfwidth Katakana Letter 'HI'
9F JH300000 Katakana 'HU'/'FU'
U+FF8C Halfwidth Katakana Letter 'HU'
A2 JH400000 Katakana 'HE'
U+FF8D Halfwidth Katakana Letter 'HE'
A3 JH500000 Katakana 'HO'
U+FF8E Halfwidth Katakana Letter 'HO'
A4 JM100000 Katakana 'MA'
U+FF8F Halfwidth Katakana Letter 'MA'
A5 JM200000 Katakana 'MI'
U+FF90 Halfwidth Katakana Letter 'MI'
A6 JM300000 Katakana 'MU'
U+FF91 Halfwidth Katakana Letter 'MU'
A7 JM400000 Katakana 'ME'
U+FF92 Halfwidth Katakana Letter 'ME'
A8 JM500000 Katakana 'MO'
U+FF93 Halfwidth Katakana Letter 'MO'
A9 JY100000 Katakana 'YA'
U+FF94 Halfwidth Katakana Letter 'YA'
AA JY300000 Katakana 'YU'
U+FF95 Halfwidth Katakana Letter 'YU'
AC JY500000 Katakana 'YO'
U+FF96 Halfwidth Katakana Letter 'YO'
AD JR100000 Katakana 'RA'
U+FF97 Halfwidth Katakana Letter 'RA'
AE JR200000 Katakana 'RI'
U+FF98 Halfwidth Katakana Letter 'RI'
AF JR300000 Katakana 'RU'
U+FF99 Halfwidth Katakana Letter 'RU'
BA JR400000 Katakana 'RE'
U+FF9A Halfwidth Katakana Letter 'RE'
BB JR500000 Katakana 'RO'
U+FF9B Halfwidth Katakana Letter 'RO'
BC JW100000 Katakana 'WA'
U+FF9C Halfwidth Katakana Letter 'WA'
BD JN000000 Katakana 'N'
U+FF9D Halfwidth Katakana Letter 'N'
BE JX710000 Voiced sound symbol
U+FF9E Halfwidth Katakana Voiced sound Mark
BF JX720000 Semi-voiced sound symbol
U+FF9F Halfwidth Katakana Semi-voiced sound Mark

This is not a well-formed EBCDIC encoding because graphic characters appear in the range reserved for control characters, all common control characters are not present, and the codepoint reserved for the Eight Ones character is not included.

The Database intentionally returns lower case English alphabetic characters as their upper-case equivalents. That is, codepoints X'14', X'17', X'35', X'64' through X'6A', X'CB' through X'CF', X'70' through X'78', X'09', and X'CA' are returned as X'C1' through X'C9', X'D1' through X'D9', and X'E2' through X'E9', respectively.

The Database defines the Overline character for KANJIEBCDIC5026_0I, KANJIEBCDIC5035_0I, KATAKANAEBCDIC, and SCHEBCDIC935_2IJ differently than for the other character sets. So if sent to the Database using a character set in one group but received from the Database using a character set in the other group, the codepoint will change.

No special processing is performed by the Database for control characters, except for Shift Out and Shift In, which switch to and from double-byte codepoints.