NaiveBayesTextClassifierTrainer Input

NaiveBayesTextClassifierTrainer Input - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product

Teradata Vantage

Release Number

8.00

1.0

Published

May 2019

Language

English (United States)

Last Update

2019-11-22

dita:mapPath

blj1506016597986.ditamap

dita:ditavalPath

blj1506016597986.ditaval

dita:id

B700-4003

lifecycle

Product Category

Teradata Vantage™

Table	Description
tokens	Contains classified training tokens. Usually output by a tokenizing function, such as TextTokenizer or TextParser.
[Optional] categories	Contains prediction categories to use in model, which you can also specify with Categories argument. If you omit both this table and Categories argument, function uses all categories specified by DocCategoryColumn argument.
[Optional] stop_words	Contains stop words (a, an, the, and so on). If you omit this table, you must specify stop words with StopWords argument.

Column	Data Type	Description
doc_id_column	CHARACTER, VARCHAR, INTEGER, or SMALLINT	[Column appears once for each specified doc_id_column.] Identifier of document that contains classified training tokens.
token_column	CHARACTER or VARCHAR	Classified training token.
doc_category_column	CHARACTER or VARCHAR	Category of document. Partition table by this column.

Column	Data Type	Description
category_column	CHARACTER or VARCHAR	Prediction category.

Column	Data Type	Description
stop_words_column	CHARACTER or VARCHAR	Stop word.