1.0 - 8.00 - TextTagger Arguments - Teradata Vantage

Teradata® Vantage Machine Learning Engine Analytic Function Reference

Teradata Vantage
Release Number
Release Date
May 2019
Content Type
Programming Reference
Publication ID
English (United States)
[Optional] Specify the language of the input text:
Option Description
'en' (Default) English
'zh_CN' Simplified Chinese
'zh_TW' Traditional Chinese
[Required if you do not specify a rules table, disallowed otherwise.] Specify the tag names and tagging rules. For information about defining tagging rules, see Defining Tagging Rules.
[Optional] Specify whether the function tokenizes the input text before evaluating the rules and tokenizes the text string parameter in the rule definition when parsing a rule.

If you specify 'true', then you must also specify the InputLanguage argument. The function uses the value of InputLanguage to create the word tokenizer.

Default: 'false'
[Optional] Specify whether the function outputs a tuple when a text document matches multiple tags.
Default: 'false' (One tuple in the output stands for one document and the matched tags are listed in the output column tag.)
Specify the delimiter, a string, that separates multiple tags in the output column tag if OutputByTag has the value 'false'. If OutputByTag has the value 'true', specifying this argument causes an error.
Default: ',' (comma)
[Optional] Specify the names of text table columns to copy to the output table.
Do not use the name 'tag' for an accumulate_column, because the function uses that name for the output table column that contains the tags.