The character set on the Teradata Database named KATAKANAEBCDIC5026_0I is intended as an extended EBCDIC character set consisting of both one and two-bytes per character. Architecturally, the EBCDIC encoding scheme consists of 256 possible values (codepoints) represented as hexadecimal values in the range X'00' to X'FF'
To support more than 256 codepoints, the EBCDIC encoding scheme is extended by defining the Shift-out control character to switch from one byte per character to two bytes per character until the Shift-in control character is encountered. The first byte of codepoints between the Shift-out and Shift-in control characters is always between X'41' and X'FE'. Currently, the second byte is also between X'41' and X'FE'. The X'4040' codepoint is defined as the Double-byte Space character. No double-byte control characters exist. The double-byte characters are not described.
|
x0
|
x1
|
x2
|
x3
|
x4
|
x5
|
x6
|
x7
|
x8
|
x9
|
xA
|
xB
|
xC
|
xD
|
xE
|
xF
|
0x |
NUL
|
SOH
|
STX
|
ETX
|
|
HT
|
|
|
|
y
|
|
VT
|
FF
|
CR
|
SO
|
SI
|
1x |
¢
|
|
¬
|
/
|
a
|
|
BS
|
b
|
CAN
|
EM
|
|
|
IS4
|
IS3
|
IS2
|
IS1
|
2x |
|
|
|
|
|
LF
|
ETB
|
ESC
|
|
|
|
|
|
ENQ
|
ACK
|
BEL
|
3x |
|
|
SYN
|
|
|
c
|
|
EOT
|
|
|
|
|
~
|
NAK
|
|
|
4x |
SP
|
|
|
|
|
|
|
|
|
|
£
|
.
|
<
|
(
|
+
|
|
|
5x |
&
|
|
|
|
|
|
|
|
|
|
!
|
¥
|
*
|
)
|
;
|
^
|
6x |
-
|
/
|
|
|
d
|
e
|
f
|
g
|
h
|
i
|
j
|
,
|
%
|
_
|
>
|
?
|
7x |
p
|
q
|
r
|
s
|
t
|
u
|
v
|
w
|
x
|
´
|
:
|
#
|
@
|
'
|
=
|
"
|
8x |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9x |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Ax |
[
|
¯
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Bx
|
] |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Cx |
{
|
A
|
B
|
C
|
D
|
E
|
F
|
G
|
H
|
I
|
z
|
k
|
l
|
m
|
n
|
o
|
Dx |
}
|
J
|
K
|
L
|
M
|
N
|
O
|
P
|
Q
|
R
|
|
|
|
|
|
|
Ex |
$
|
|
S
|
T
|
U
|
V
|
W
|
X
|
Y
|
Z
|
|
|
|
|
|
|
Fx |
0
|
1
|
2
|
3
|
4
|
5
|
6
|
7
|
8
|
9
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Control character codepoints |
|||||||||||||||
|
Reserved codepoints |
|||||||||||||||
|
Katakana codepoints. Refer to Table 58 on page 403 for details. |
The server will reject character data containing any single or double byte reserved codepoint and will not identify which invalid codepoint was present.
codepoint |
IBM GCGID |
IBM description |
Unicode® code |
Unicode® name |
|
41 |
JQ700000 |
Katakana full stop |
U+FF61 |
Halfwidth Ideographic Full Stop |
|
42 |
JQ710000 |
Katakana left Bracket |
U+FF62 |
Halfwidth Left Corner Bracket |
|
43 |
JQ720000 |
Katakana right Bracket |
U+FF63 |
Halfwidth Right Corner Bracket |
|
44 |
JQ730000 |
Katakana comma |
U+FF64 |
Halfwidth Ideographic Comma |
|
45 |
JQ74000 |
Katakana conjunctive symbol |
U+FF65 |
Halfwidth Katakana Middle Dot |
|
46
|
JW500000 |
Katakana 'WO' |
U+FF66 |
Halfwidth Katakana Letter 'WO' |
|
47
|
JA010000 |
Katakana 'a' |
U+FF67 |
Halfwidth Katakana Letter Small 'a' |
|
48
|
JI010000 |
Katakana 'i' |
U+FF68 |
Halfwidth Katakana Letter Small 'i' |
|
49
|
JU010000 |
Katakana 'u' |
U+FF69 |
Halfwidth Katakana Letter Small 'u' |
|
51 |
JE010000 |
Katakana 'e' |
U+FF6A |
Halfwidth Katakana Letter Small 'e' |
|
52 |
JO010000 |
Katakana 'o' |
U+FF6B |
Halfwidth Katakana Letter Small 'o' |
|
53 |
JY110000 |
Katakana 'ya' |
U+FF6C |
Halfwidth Katakana Letter Small 'ya' |
|
54 |
JY310000 |
Katakana 'yu' |
U+FF6D |
Halfwidth Katakana Letter Small 'yu' |
|
55 |
JY510000 |
Katakana 'yo' |
U+FF6E |
Halfwidth Katakana Letter Small 'yo' |
|
56 |
JT310000 |
Katakana 'tu'/'tsu' |
U+FF6F |
Halfwidth Katakana Letter Small 'tu' |
|
58 |
JX700000 |
Katakana prolonged sound symbol |
U+FF70 |
Halfwidth Katakana-Hiragana prolonged sound mark |
|
81 |
JA000000 |
Katakana 'A' |
U+FF71 |
Halfwidth Katakana Letter 'A' |
|
82 |
JI000000 |
Katakana 'I' |
U+FF72 |
Halfwidth Katakana Letter 'I' |
|
83 |
JU000000 |
Katakana 'U' |
U+FF73 |
Halfwidth Katakana Letter 'U' |
|
84 |
JE000000 |
Katakana 'E' |
U+FF74 |
Halfwidth Katakana Letter 'E' |
|
85 |
JO000000 |
Katakana 'O' |
U+FF75 |
Halfwidth Katakana Letter 'O' |
|
86 |
JK100000 |
Katakana 'KA' |
U+FF76 |
Halfwidth Katakana Letter 'KA' |
|
87 |
JK200000 |
Katakana 'KI' |
U+FF77 |
Halfwidth Katakana Letter 'KI' |
|
88 |
JK300000 |
Katakana 'KU' |
U+FF78 |
Halfwidth Katakana Letter 'KU' |
|
89 |
JK400000 |
Katakana 'KE' |
U+FF79 |
Halfwidth Katakana Letter 'KE' |
|
8A |
JK500000 |
Katakana 'KO' |
U+FF7A |
Halfwidth Katakana Letter 'KO' |
|
8C |
JS100000 |
Katakana 'SA' |
U+FF7B |
Halfwidth Katakana Letter 'SA' |
|
8D |
JS200000 |
Katakana 'SI'/'SHI' |
U+FF7C |
Halfwidth Katakana Letter 'SI' |
|
8E |
JS300000 |
Katakana 'SU' |
U+FF7D |
Halfwidth Katakana Letter 'SU' |
|
8F |
JS400000 |
Katakana 'SE' |
U+FF7E |
Halfwidth Katakana Letter 'SE' |
|
90 |
JS500000 |
Katakana 'SO' |
U+FF7F |
Halfwidth Katakana Letter 'SO' |
|
91 |
JT100000 |
Katakana 'TA' |
U+FF80 |
Halfwidth Katakana Letter 'TA' |
|
92 |
JT200000 |
Katakana 'TI'/'CHI' |
U+FF81 |
Halfwidth Katakana Letter 'TI' |
|
93 |
JT300000 |
Katakana 'TU'/'TSU' |
U+FF82 |
Halfwidth Katakana Letter 'TU' |
|
94 |
JT400000 |
Katakana 'TE' |
U+FF83 |
Halfwidth Katakana Letter 'TE' |
|
95 |
JT500000 |
Katakana 'TO' |
U+FF84 |
Halfwidth Katakana Letter 'TO' |
|
96 |
JN100000 |
Katakana 'NA' |
U+FF85 |
Halfwidth Katakana Letter 'NA' |
|
97 |
JN200000 |
Katakana 'NI' |
U+FF86 |
Halfwidth Katakana Letter 'NI' |
|
98 |
JN300000 |
Katakana 'NU' |
U+FF87 |
Halfwidth Katakana Letter 'NU' |
|
99 |
JN400000 |
Katakana 'NE' |
U+FF88 |
Halfwidth Katakana Letter 'NE' |
|
9A |
JN500000 |
Katakana 'NO' |
U+FF89 |
Halfwidth Katakana Letter 'NO' |
|
9D |
JH100000 |
Katakana 'HA' |
U+FF8A |
Halfwidth Katakana Letter 'HA' |
|
9E |
JH200000 |
Katakana 'HI' |
U+FF8B |
Halfwidth Katakana Letter 'HI' |
|
9F |
JH300000 |
Katakana 'HU'/'FU' |
U+FF8C |
Halfwidth Katakana Letter 'HU' |
|
A2 |
JH400000 |
Katakana 'HE' |
U+FF8D |
Halfwidth Katakana Letter 'HE' |
|
A3 |
JH500000 |
Katakana 'HO' |
U+FF8E |
Halfwidth Katakana Letter 'HO' |
|
A4 |
JM100000 |
Katakana 'MA' |
U+FF8F |
Halfwidth Katakana Letter 'MA' |
|
A5 |
JM200000 |
Katakana 'MI' |
U+FF90 |
Halfwidth Katakana Letter 'MI' |
|
A6 |
JM300000 |
Katakana 'MU' |
U+FF91 |
Halfwidth Katakana Letter 'MU' |
|
A7 |
JM400000 |
Katakana 'ME' |
U+FF92 |
Halfwidth Katakana Letter 'ME' |
|
A8 |
JM500000 |
Katakana 'MO' |
U+FF93 |
Halfwidth Katakana Letter 'MO' |
|
A9 |
JY100000 |
Katakana 'YA' |
U+FF94 |
Halfwidth Katakana Letter 'YA' |
|
AA |
JY300000 |
Katakana 'YU' |
U+FF95 |
Halfwidth Katakana Letter 'YU' |
|
AC |
JY500000 |
Katakana 'YO' |
U+FF96 |
Halfwidth Katakana Letter 'YO' |
|
AD |
JR100000 |
Katakana 'RA' |
U+FF97 |
Halfwidth Katakana Letter 'RA' |
|
AE |
JR200000 |
Katakana 'RI' |
U+FF98 |
Halfwidth Katakana Letter 'RI' |
|
AF |
JR300000 |
Katakana 'RU' |
U+FF99 |
Halfwidth Katakana Letter 'RU' |
|
BA |
JR400000 |
Katakana 'RE' |
U+FF9A |
Halfwidth Katakana Letter 'RE' |
|
BB |
JR500000 |
Katakana 'RO' |
U+FF9B |
Halfwidth Katakana Letter 'RO' |
|
BC |
JW100000 |
Katakana 'WA' |
U+FF9C |
Halfwidth Katakana Letter 'WA' |
|
BD |
JN000000 |
Katakana 'N' |
U+FF9D |
Halfwidth Katakana Letter 'N' |
|
BE |
JX710000 |
Voiced sound symbol |
U+FF9E |
Halfwidth Katakana Voiced sound Mark |
|
BF |
JX720000 |
Semi-voiced sound symbol |
U+FF9F |
Halfwidth Katakana Semi-voiced sound Mark |
This is not a well-formed EBCDIC encoding because graphic characters appear in the range reserved for control characters, all common control characters are not present, and the codepoint reserved for the Eight Ones character is not included.
The server intentionally returns lower case English alphabetic characters as their upper-case equivalents. That is, codepoints X'14', X'17', X'35', X'64' through X'6A', X'CB' through X'CF', X'70' through X'78', X'09', and X'CA' are returned as X'C1' through X'C9', X'D1' through X'D9', and X'E2' through X'E9', respectively.
The server defines the Overline character for KANJIEBCDIC5026_0I, KANJIEBCDIC5035_0I, KATAKANAEBCDIC, and SCHEBCDIC935_2IJ differently than for the other character sets. So if sent to the server using a character set in one group but received from the server using a character set in the other group, the codepoint will change.
No special processing is performed by the server for control characters, except for Shift Out and Shift In, which switch to and from double-byte codepoints.