- Language
- [Optional] Specifies the language of the input text:
- 'en': English (Default)
- 'zh_cn': Simplified Chinese
- 'zh_tw': Traditional Chinese
If UseTokenizer specifies 'true', then the function uses the value of Language to create the word tokenizer.
- Rules
- [Required if you do not specify a rules table, disallowed otherwise.] Specifies the tag names and tagging rules. For information about defining tagging rules, see Defining Tagging Rules.
- Tokenize
- [Optional] Specifies whether the function tokenizes the input text before evaluating the rules and tokenizes the text string parameter in the rule definition when parsing a rule. If you specify 'true', then you must also specify the Language argument. Default: 'false'.
- OutputByTag
- [Optional] Specifies whether the function outputs a tuple when a text document matches multiple tags. Default: 'false' (one tuple in the output stands for one document and the matched tags are listed in the output column tag).
- TagDelimiter
- [Optional]
- Accumulate
- [Optional] Specifies the names of text table columns to copy to the output table.Do not use the name 'tag' for an accumulate_column, because the function uses that name for the output table column that contains the tags.