NaiveBayesTextClassifierPredict Input - Teradata® Database

Database Analytic Functions

Product
Teradata® Database
Release Number
17.10
Published
July 2021
Language
English (United States)
Last Update
2021-07-28
dita:mapPath
Teradata_Vantage™___Advanced_SQL_Engine_Analytic_Functions.withLogo_upload_July2021/wnd1589838592459.ditamap
dita:ditavalPath
Teradata_Vantage™___Advanced_SQL_Engine_Analytic_Functions.withLogo_upload_July2021/ayr1485454803741.ditaval
dita:id
B035-1206
lifecycle
previous
Product Category
Teradata Vantage™
Table Description
PredictorValues Contains test data, for which to predict outcomes, in document-token pairs. To transform input document into this form, input it to ML Engine function TextTokenizer or TextParser.

TextTokenizer and TextParser have language-processing limitations that might limit support for Unicode input data (see Teradata Vantage™ Machine Learning Engine Analytic Function Reference, B700-4003).

Model Model output by ML Engine NaiveBayesTextClassifierTrainer2 function. For schema, see Teradata Vantage™ Machine Learning Engine Analytic Function Reference, B700-4003.

PredictorValues Schema

Column Data Type Description
doc_id_column CHARACTER, VARCHAR, INTEGER, or SMALLINT Identifier of document that contains classified testing tokens.
token_column CHARACTER or VARCHAR Testing token.
accumulate_column Any Column to copy to output table.

Model Schema

For CHARACTER and VARCHAR columns, CHARACTER SET must be either UNICODE or LATIN.

Column Data Type Description
token CHARACTER or VARCHAR Classified training token.
category CHARACTER or VARCHAR Prediction category for token.
prob DOUBLE PRECISION Probability that token is in category.