TD_TextParser Output - Analytics Database

Database Analytic Functions

Deployment
VantageCloud
VantageCore
Edition
Enterprise
IntelliFlex
VMware
Product
Analytics Database
Release Number
17.20
Published
June 2022
Language
English (United States)
Last Update
2024-10-04
dita:mapPath
gjn1627595495337.ditamap
dita:ditavalPath
ayr1485454803741.ditaval
dita:id
jmh1512506877710
lifecycle
latest
Product Category
Teradata Vantageā„¢

Output Table Schema

Column Data Type Description
docIdColumn BYTEINT, SMALLINT, INTEGER, BIGINT, CHAR/VARCHAR Unique identifier of input rows. If provided, it is always the first column in output table.
AccumulateColumns ANY Columns to be copied from input to output.

Default: All input columns are copied to output.

TokenColumn VARCHAR Column containing individual tokens.
frequency INTEGER (Optional) Value indicating the total occurrences of a token.
locations VARCHAR, BIGINT (Optional) Comma separated list of values, sorted in ascending order.

When ListPositions is set to false and OutputByWord is true, then the position in the input text of each single token is displayed in a separate row. However, the type for this column is BIGINT instead.

tokens CHAR/VARCHAR/CLOB (Optional) When OutputByWord argument is 'false': Space separated list of tokens in a single cell.