Output Table Schema
Column | Data Type | Description |
---|---|---|
docIdColumn | BYTEINT, SMALLINT, INTEGER, BIGINT, CHAR/VARCHAR | Unique identifier of input rows. If provided, it is always the first column in output table. |
AccumulateColumns | ANY | Columns to be copied from input to output. Default: All input columns are copied to output. |
TokenColumn | VARCHAR | Column containing individual tokens. |
frequency | INTEGER | (Optional) Value indicating the total occurrences of a token. |
locations | VARCHAR, BIGINT | (Optional) Comma separated list of values, sorted in ascending order. When ListPositions is set to false and OutputByWord is true, then the position in the input text of each single token is displayed in a separate row. However, the type for this column is BIGINT instead. |
tokens | CHAR/VARCHAR/CLOB | (Optional) When OutputByWord argument is 'false': Space separated list of tokens in a single cell. |