7.00.02 - TextTagging Arguments - Aster Analytics

Teradata Aster® Analytics Foundation User GuideUpdate 2

Product
Aster Analytics
Release Number
7.00.02
Release Date
September 2017
Content Type
Programming Reference
User Guide
Publication ID
B700-1022-700K
Language
English (United States)
Language
[Optional] Specifies the language of the input text:
  • 'en': English (Default)
  • 'zh_cn': Simplified Chinese
  • 'zh_tw': Traditional Chinese

If UseTokenizer specifies 'true', then the function uses the value of Language to create the word tokenizer.

Rules
[Required if you do not specify a rules table, disallowed otherwise.] Specifies the tag names and tagging rules. For information about defining tagging rules, see Defining Tagging Rules.
Tokenize
[Optional] Specifies whether the function tokenizes the input text before evaluating the rules and tokenizes the text string parameter in the rule definition when parsing a rule. If you specify 'true', then you must also specify the Language argument. Default: 'false'.
OutputByTag
[Optional] Specifies whether the function outputs a tuple when a text document matches multiple tags. Default: 'false' (one tuple in the output stands for one document and the matched tags are listed in the output column tag).
TagDelimiter
[Optional]
Specifies the delimiter, a string, that separates multiple tags in the output column tag if OutputByTag has the value 'false'. Default: ',' (comma). If OutputByTag has the value 'true', specifying this argument causes an error.
Accumulate
[Optional] Specifies the names of text table columns to copy to the output table.
Do not use the name 'tag' for an accumulate_column, because the function uses that name for the output table column that contains the tags.