The NaiveBayesTextClassifierPredict function has two required input tables, the model_table output by the function NaiveBayesTextClassifierTrainer, and input_table, which contains the test data for which to predict outcomes.
The test data must be in the form of document-token pairs (as in the following table). To transform the input documents into this form, input them to the function TextTokenizer or Text_Parser.
Column Name | Data Type | Description |
---|---|---|
doc_id_column | CHARACTER, VARCHAR, text, INTEGER, or SMALLINT | Contains the identifiers of the documents that contain the classified training tokens. A document identifier have multiple columns; therefore, the table can have more than one such column. |
token_column | CHARACTER, VARCHAR, or text | Contains the classified training tokens. |