NaiveBayesTextClassifierPredict_MLE Syntax Elements - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
9.02
9.01
2.0
1.3
Published
February 2022
Language
English (United States)
Last Update
2022-02-10
dita:mapPath
rnn1580259159235.ditamap
dita:ditavalPath
ybt1582220416951.ditaval
dita:id
B700-4003
lifecycle
previous
Product Category
Teradata Vantageā„¢
InputTextColumn
InputTokenColumn
Specify the name of the InputTable column that contains the tokens.
ModelType
[Optional] Specify the model type of the text classifier.
Default: 'Multinomial'
DocIDColumns
[Optional] Specify the names of the InputTable columns that contain the document identifier.
ModelTokenColumn
[Optional] Specify the name of the Model table column that contains the tokens.
Default: First column of Model table
ModelCategoryColumn
[Optional] Specify the name of the Model table column that contains the prediction categories.
Default: Second column of Model table
ModelProbColumn
[Optional] Specify the name of the Model table column that contains the token counts.
Default: Third column of Model table
IsTokenized
[Optional] Specify whether the input data is already tokenized. With IsTokenized ('true'), the function does not tokenize the input data. Specifying IsTokenized ('true') with untokenized input data may result in an ambiguous or meaningless model.
Default: 'false'
TopK
[Disallowed with Responses, otherwise optional.] Specify the number of class labels to appear in the output table. For each observation, num_of_top_k_predictions appear in the output table, corresponding to the num_of_top_k_predictions most likely classes. To see the probability of each class, use OutputProb ('true').
Default: All prediction categories
OutputProb
[Required to be 'true' with Responses, optional otherwise.] Specify whether to output the calculated probability for each observation.
Default: 'false'
Responses
[Disallowed with TopK, otherwise optional.] Specify the labels for which to output probabilities.
Default behavior: Output only the probability of the predicted class.
ConvertToLowerCase
[Optional with IsTokenized ('false'), disallowed otherwise.] Specify whether to convert all letters in the input text to lowercase.
Default: 'false'
StemTokens
[Optional with IsTokenized ('false'), disallowed otherwise.] Specify whether to stem the tokens as part of text tokenization.
Default: 'true'
Accumulate
Specify the names of the PredictorValues table columns to copy to the output table.