Teradata Aster® Analytics Foundation User Guide Update 2

Aster Analytics
September 2017
English (United States)
The NaiveBayesTextClassifierPredict function has two required input tables, the model_table output by the function NaiveBayesTextClassifierTrainer, and input_table, which contains the test data for which to predict outcomes.

The test data must be document-token pairs (as in the following table). To transform the input documents into this form, input them to the function TextTokenizer or Text_Parser.

NaiveBayesTextClassifierPredict Input Table Schema
Column Name Data Type Description
doc_id_column CHARACTER, VARCHAR, text, INTEGER, or SMALLINT Contains the identifiers of the documents that contain the classified training tokens. A document identifier have multiple columns; therefore, the table can have more than one such column.
token_column CHARACTER, VARCHAR, or text Contains the classified training tokens.