Table | Description |
---|---|
PredictorValues | Contains test data, for which to predict outcomes, in document-token pairs. To transform input document into this form, input it to ML Engine function TextTokenizer or TextParser. TextTokenizer and TextParser have language-processing limitations that might limit support for Unicode input data (see Teradata Vantage™ Machine Learning Engine Analytic Function Reference, B700-4003). |
Model | Model output by ML Engine NaiveBayesTextClassifierTrainer2 function. For schema, see Teradata Vantage™ Machine Learning Engine Analytic Function Reference, B700-4003. |
PredictorValues Schema
Column | Data Type | Description |
---|---|---|
doc_id_column | CHARACTER, VARCHAR, INTEGER, or SMALLINT | Identifier of document that contains classified testing tokens. |
token_column | CHARACTER or VARCHAR | Testing token. |
accumulate_column | Any | Column to copy to output table. |
Model Schema
For CHARACTER and VARCHAR columns, CHARACTER SET must be either UNICODE or LATIN.
Column | Data Type | Description |
---|---|---|
token | CHARACTER or VARCHAR | Classified training token. |
category | CHARACTER or VARCHAR | Prediction category for token. |
prob | DOUBLE PRECISION | Probability that token is in category. |