Character Set Files | Teradata Vantage NewSQL Engine - 17.10 - Character Set Files - Advanced SQL Engine - Teradata Database

Teradata Vantage™ - Advanced SQL Engine International Character Set Support

Product
Advanced SQL Engine
Teradata Database
Release Number
17.10
Release Date
July 2021
Content Type
Configuration
User Guide
Publication ID
B035-1125-171K
Language
English (United States)
The text files listed in the tables provide the following information:
  • Valid characters that you can use in object names
  • Mappings between character sets
  • Character set collations
Download these files as a zip file:
  1. Access Teradata Vantage™ - Advanced SQL Engine International Character Set Support, B035-1125 at https://docs.teradata.com/.
  2. In the left pane, download the zip file from the download tab .

Valid Characters in Object Names

Title File Name Publication ID
Unicode Characters Allowed in Object Names on Systems with Extended Object Naming UOBJNEXT.txt B035-1200
Unicode in Object Names on Japanese Language Support Systems UOBJNJAP.txt B035-1177
Unicode in Object Names on Standard Language Support Systems UOBJNSTD.txt B035-1176

Character Set Mappings and Collations

Title Filename Description Publication ID
ARABIC1256_6A0 to Unicode A6A0SUCD.txt ARABIC1256 to Unicode B035-1165
CYRILLIC1251_2A0 to Unicode C2A0SUCD.txt CYRILLIC1251 to Unicode B035-1166
HANGUL949_7R0 Multibyte to Unicode H7R0MUCD.txt HANGUL949 (multibyte character portion) to Unicode B035-1170
HANGUL949_7R0 Single Byte to Unicode H7R0SUCD.txt HANGUL949 (single-byte character portion) to Unicode B035-1169
HANGULEBCDIC933_1II Multibyte to Unicode H1IMUNCD.txt Hangul EBCDIC (IBM CCSID 933) multibyte (IBM CP 834) to Unicode B035-1135
HANGULEBCDIC933_1II Single Byte to Unicode H1ISUNCD.txt Hangul EBCDIC (IBM CCSID 933) single-byte (IBM CP 833) to Unicode B035-1134
HANGULKSC5601_2R4 Multibyte to Unicode H2RMUNCD.txt Hangul (mixed KS Roman/KS C 5601) multibyte (KS C 5601) to Unicode B035-1137
HANGULKSC5601_2R4 Single Byte to Unicode H2RSUNCD.txt Hangul (mixed KS Roman/KS C 5601) single-byte (KS Roman) to Unicode B035-1136
HEBREW1255_5A0 to Unicode H5A0SUCD.txt HEBREW1255 to Unicode B035-1164
JIS_COLL Case-Blind Collation JISCOLBL.txt JIS_COLL Case-Blind collation B035-1061
JIS_COLL Case-Specific Collation JIS_COLL.txt JIS_COLL Case-Specific collation B035-1060
KANJI932_1S0 Multibyte to Unicode K1S0MUCD.txt KANJI932 (multibyte character portion) to Unicode B035-1175
KANJI932_1S0 Single Byte to Unicode K1S0SUCD.txt KANJI932 (single-byte character portion) to Unicode B035-1174
KanjiEBCDIC Multibyte (SO/SI) to Unicode SOSIUNCD.txt KanjiEBCDIC Multibyte (Shift-Out/Shift-In) to Unicode B035-1055
KanjiEUC Code Set 1 to Unicode EUC1UNCD.txt KanjiEUC Code Set 1 (JIS-x0208) to Unicode B035-1115
KanjiEUC Code Set 2 to Unicode EUC2UNCD.txt KanjiEUC Code Set 2 (JIS-x0201 Katakana) to Unicode B035-1139
KanjiEUC Code Set 3 to Unicode EUC3UNCD.txt KanjiEUC Code Set 3 (JIS-x0212) to Unicode B035-1116
KanjiShiftJIS to KanjiShiftJIS Multibyte SJISSJIS.txt KanjiSJIS to KanjiSJIS multibyte characters B035-1053
KanjiShiftJIS to Unicode Multibyte SJISUNCD.txt

KanjiSJIS to multibyte Unicode

B035-1054
LATIN1250_1A0 to Unicode L1A0SUCD.txt LATIN1250 to Unicode B035-1168
LATIN1252_3A0 to Unicode L3A0SUCD.txt LATIN1252 to Unicode B035-1163
LATIN1254 to Unicode L7A0SUCD.txt LATIN1254 to Unicode B035-1171
LATIN1258_8A0 to Unicode L8A0SUCD.txt LATIN1258 to Unicode B035-1173
LATIN Server Character Set latin_server.txt Supported characters in the NewSQL Engine LATIN server character set B035-1207
Multinational Case-Blind Default Collation blinddef.txt Default for Multinational Case-Blind collation B035-1050
Multinational Case-Specific Default Collation multnatl.txt Default for Multinational Case-Specific collation B035-1062
SCHEBCDIC935_2IJ Multibyte to Unicode C2IMUNCD.txt Simplified Chinese EBCDIC (IBM CCSID 935) multibyte (IBM CP 837) to Unicode B035-1131
SCHEBCDIC935_2IJ Single Byte to Unicode C2ISUNCD.txt Simplified Chinese EBCDIC (IBM CCSID 935) single-byte (IBM CP 836) to Unicode B035-1130
SCHGB2312_1T0 Code Set 0 to Unicode C1T0UNCD.txt Simplified Chinese (mixed ASCII/GB 2312-1980) Code Set 0 (ASCII) to Unicode B035-1126
SCHGB2312_1T0 Code Set 1 to Unicode C1T1UNCD.txt Simplified Chinese (mixed ASCII/GB 2312-1980) Code Set 1 (GB 2312-1980) to Unicode B035-1127
SCHINESE936_6R0 Multibyte to Unicode S6R0MUCD.txt SCHINESE936 (multibyte character portion) to Unicode B035-1162
SCHINESE936_6R0 Single Byte to Unicode S6R0SUCD.txt SCHINESE936 (single-byte character portion) to Unicode B035-1161
TCHBIG5_1R0 Multibyte to Unicode C1RMUNCD.txt Traditional Chinese (Big5) multibyte to Unicode B035-1129
TCHBIG5_1R0 Single Byte to Unicode C1RSUNCD.txt Traditional Chinese (Big5) single-byte (ASCII) to Unicode B035-1128
TCHEBCDIC937_3IB Multibyte to Unicode C3IMUNCD.txt Traditional Chinese EBCDIC (IBM CCSID 937) multibyte (IBM CP 835) to Unicode B035-1133
TCHEBCDIC937_3IB Single Byte to Unicode C3ISUNCD.txt Traditional Chinese EBCDIC (IBM CCSID 937) single-byte (IBM CP 037) to Unicode B035-1132
TCHINESE950_8R0 Multibyte to Unicode T8R0MUCD.txt TCHINESE950 (multibyte character portion) to Unicode B035-1178
TCHINESE950_8R0 Single Byte to Unicode T8R0SUCD.txt TCHINESE950 (single-byte character portion) to Unicode B035-1172
THAI874_4A0 Single Byte to Unicode T4A0SUCD.txt THAI874 to Unicode B035-1167
UNICODE Server Character Set UNCDUNCD.txt Supported characters in the NewSQL Engine UNICODE server character set B035-1056
Unicode to KanjiEBCDIC Multibyte (SO/SI) UNCDSOSI.txt Unicode to KanjiEBCDIC multibyte (Shift-Out/Shift-In) B035-1104
Unicode to KanjiEUC Code Sets 1, 2, and 3 UNCDE123.txt Unicode to KanjiEUC Code Set 1,2,3 (JIS-x0208) as Unix Process Code (UPC) B035-1117
Unicode to KanjiSJIS Multibyte UNCDSJIS.txt Unicode to KanjiSJIS multibyte B035-1058
UNICODE to UNICODE_Fullwidth UNCDH2F.txt Halfwidth Unicode to fullwidth Unicode B035-1202
UNICODE to UNICODE_Halfwidth UNCDF2H.txt Fullwidth Unicode to halfwidth Unicode B035-1201
UNICODE to VARGRAPHIC UNCDVARG.txt
  • Halfwidth letters of Unicode to the fullwidth letters of Unicode
  • SPACE (0x0020) to the IDEOGRAPHIC SPACE (0x3000)
  • Valid characters of graphic
B035-1057