TextTagging Arguments - Aster Analytics

Teradata AsterĀ® Analytics Foundation User GuideUpdate 2

Product
Aster Analytics
Release Number
7.00.02
Published
September 2017
Language
English (United States)
Last Update
2018-04-17
dita:mapPath
uce1497542673292.ditamap
dita:ditavalPath
AA-notempfilter_pdf_output.ditaval
dita:id
B700-1022
lifecycle
previous
Product Category
Software
Language
[Optional] Specifies the language of the input text:
  • 'en': English (Default)
  • 'zh_cn': Simplified Chinese
  • 'zh_tw': Traditional Chinese

If UseTokenizer specifies 'true', then the function uses the value of Language to create the word tokenizer.

Rules
[Required if you do not specify a rules table, disallowed otherwise.] Specifies the tag names and tagging rules. For information about defining tagging rules, see Defining Tagging Rules.
Tokenize
[Optional] Specifies whether the function tokenizes the input text before evaluating the rules and tokenizes the text string parameter in the rule definition when parsing a rule. If you specify 'true', then you must also specify the Language argument. Default: 'false'.
OutputByTag
[Optional] Specifies whether the function outputs a tuple when a text document matches multiple tags. Default: 'false' (one tuple in the output stands for one document and the matched tags are listed in the output column tag).
TagDelimiter
[Optional]
Specifies the delimiter, a string, that separates multiple tags in the output column tag if OutputByTag has the value 'false'. Default: ',' (comma). If OutputByTag has the value 'true', specifying this argument causes an error.
Accumulate
[Optional] Specifies the names of text table columns to copy to the output table.
Do not use the name 'tag' for an accumulate_column, because the function uses that name for the output table column that contains the tags.