TextTagging Arguments - Aster Analytics

Teradata Aster® Analytics Foundation User GuideUpdate 2

Product

Aster Analytics

Release Number

7.00.02

Published

September 2017

Language

English (United States)

Last Update

2018-04-17

dita:mapPath

uce1497542673292.ditamap

dita:ditavalPath

AA-notempfilter_pdf_output.ditaval

dita:id

B700-1022

lifecycle

Product Category

Software

Language

[Optional] Specifies the language of the input text:

'en': English (Default)
'zh_cn': Simplified Chinese
'zh_tw': Traditional Chinese

If UseTokenizer specifies 'true', then the function uses the value of Language to create the word tokenizer.

Rules

[Required if you do not specify a rules table, disallowed otherwise.] Specifies the tag names and tagging rules. For information about defining tagging rules, see Defining Tagging Rules.

Tokenize

[Optional] Specifies whether the function tokenizes the input text before evaluating the rules and tokenizes the text string parameter in the rule definition when parsing a rule. If you specify 'true', then you must also specify the Language argument. Default: 'false'.

OutputByTag

[Optional] Specifies whether the function outputs a tuple when a text document matches multiple tags. Default: 'false' (one tuple in the output stands for one document and the matched tags are listed in the output column tag).

TagDelimiter

[Optional]

Specifies the delimiter, a string, that separates multiple tags in the output column tag if OutputByTag has the value 'false'. Default: ',' (comma). If OutputByTag has the value 'true', specifying this argument causes an error.

Accumulate

[Optional] Specifies the names of text table columns to copy to the output table.

Do not use the name 'tag' for an accumulate_column, because the function uses that name for the output table column that contains the tags.