16.20 - SCHEBCDIC935_2IJ - Teradata Tools and Utilities

Teradata® Call-Level Interface Version 2 Reference for Mainframe-Attached Systems

prodname
Teradata Tools and Utilities
vrm_release
16.20
created_date
September 2019
category
Programming Reference
featnum
B035-2417-108K

The character set on the Teradata Database named SCHEBCDIC935_2IJ is intended as an extended EBCDIC character set consisting of both one and two-bytes per character. Architecturally, the EBCDIC encoding scheme consists of 256 possible values (codepoints) represented as hexadecimal values in the range X'00' to X'FF'

To support more than 256 codepoints, the EBCDIC encoding scheme is extended by defining the Shift-out control character to switch from one byte per character to two bytes per character until the Shift-in control character is encountered. The first byte of codepoints between the Shift-out and Shift-in control characters is always between X'41' and X'FE'. Currently, the second byte is also between X'41' and X'FE'. The X'4040' codepoint is defined as the Double-byte Space character. No double-byte control characters exist. The double-byte characters are not described.

Single-byte Teradata SCHEBCDIC935_2IJ Codepage
  x0 x1 x2 x3 x4 x5 x6 x7 x8 x9 xA xB xC xD xE xF
0 x NUL SOH STX ETX ST HT SSA DEL EPA RI SS2 VT FF CR SO SI
1 x DLE DC1 DC2 DC3 OSC NEL BS ESA CAN EM PU2 SS3 IS4 IS3 IS2 IS1
2 x UC1 UC2 BPH NBH UC3 LF ETB ESC HTS HTJ VTS PLD PLU ENQ ACK BEL
3 x DCS PU1 SYN STS CCH MW SPA EOT SOS UC4 SCI CSI DC4 NAK PM  
4 x SP                   £ · < ( + |
5 x &                   ! ¥ * ) ; ¬
6 x - /                 ¦ ¸ % _ > ?
7 x                   ` : # @ ' = "
8 x   a b c d e f g h i            
9 x   j k l m n o p q r            
A x ~ s t u v w x y z            
Bx ^   \               [ ]        
C x { A B C D E F G H I            
D x } J K L M N O P Q R            
E x $   S T U V W X Y Z            
F x 0 1 2 3 4 5 6 7 8 9           APC
                                 
Control character codepoints                          
Reserved codepoints                            

The Database will reject character data containing any single or double byte reserved codepoint and will not identify which invalid codepoint was present.

While graphic characters adhere to the standard definition for IBM code page 00836, the control characters and Eight Ones character do not because non-EBCDIC control characters appear in the range reserved for control characters, all common control characters are not present, and a non-EBCDIC control character replaces the Eight Ones character.

While IBM GCGIDs differentiate the Yen (SC050000) from the Yuan (SC120000) and here codepoint X'5B' is the Yuan, Unicode and the Database do not and use U+00A5 for both.

The Database defines the Overline character for KANJIEBCDIC5026_0I, KANJIEBCDIC5035_0I, KATAKANAEBCDIC, and SCHEBCDIC935_2IJ differently than for the other character sets. So if sent to the Database using a character set in one group but received from the Database using a character set in the other group, the codepoint will change.

No special processing is performed by the Database for control characters, except for Shift Out and Shift In, which switch to and from double-byte codepoints. The non-EBCDIC control characters Single Shift Two and Single Shift Three imply nothing about subsequent codepoints.