The Teradata Universal Coded Character Set Transformation Format (UTF8) client character set supports UTF8, a standard way of encoding Unicode character data that is optimized for backward compatibility with ASCII. This character set is usable for all languages. In Teradata UTF8, a character can consist of one to three bytes.
The UTF8 client character set is permanently enabled for use in Analytics Database.
|IF a byte in a UTF8 string is …||THEN it …|
|less than 0x80||represents the same character defined by standard ASCII.|
|greater than or equal to 0x80||is part of a multibyte sequence and is not a standard ASCII character.|