The Teradata Universal Coded Character Set Transformation Format (UTF8) client character set supports UTF8, a standard way of encoding Unicode character data that is optimized for backward compatibility with ASCII. This character set is usable for all languages. In Teradata UTF8, a character can consist of one to three bytes.
The UTF8 client character set is permanently enabled for use in Analytics Database.
IF a byte in a UTF8 string is … | THEN it … |
---|---|
less than 0x80 | represents the same character defined by standard ASCII. |
greater than or equal to 0x80 | is part of a multibyte sequence and is not a standard ASCII character. |