1.0 - 8.00 - LDA Input - Teradata Vantage

Teradata® Vantage Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
1.0
8.00
Release Date
May 2019
Content Type
Programming Reference
Publication ID
B700-4003-098K
Language
English (United States)

InputTable Schema

The size of this table determines the size of the value_col column of the ModelTable. If any cell of the value_col column exceeds 64 KB, you cannot use the ModelTable with the LDAInference or LDATopicSummary function.
Column Data Type Description
doc_id_column INTEGER, SMALLINT, BIGINT, NUMERIC, VARCHAR, or VARBYTE(n). Document identifier.
word_column INTEGER, SMALLINT, BIGINT, or VARCHAR Word.
count_column INTEGER, SMALLINT, BIGINT, NUMERIC, or DOUBLE PRECISION [Column appears only with CountColumn argument.] Number of times word appears in document.
You can use TextParser Output as input to the LDA function. Teradata recommends filtering out words with low and high frequency, which impact topics that consist of common words that are not meaningful in the topic model.