Text Analytic Functions - Teradata Vantage

Teradata® VantageCloud Lake

Deployment
VantageCloud
Edition
Lake
Product
Teradata Vantage
Published
January 2023
ft:locale
en-US
ft:lastEdition
2024-12-11
dita:mapPath
phg1621910019905.ditamap
dita:ditavalPath
pny1626732985837.ditaval
dita:id
phg1621910019905
TD_Ngramsplitter
Tokenizes (splits) an input stream and emits n multigrams, based on specified delimiter and reset parameters. Useful for sentiment analysis, topic identification, and document classification.
TD_NaiveBayesTextClassifierPredict
Uses the model output by TD_NaiveBayesTextClassifierTrainer function to analyze the input data and make predictions.
TD_NaiveBayesTextClassifierTrainer
Calculates the conditional probabilities for token-category pairs, the prior probabilities, and the missing token probabilities for all categories.
TD_SentimentExtractor
Uses a dictionary model to extract the sentiment (positive, negative, or neutral) of each input document or sentence.
TD_TextParser
Tokenizes an input stream of words and creates a row for each word in the output table.
TD_TFIDF
Takes any document set and outputs the Term Frequency, Inverse Document Frequency, and Term Frequency - Inverse Document Frequency scores for each term.
TD_WordEmbeddings
Uses training and prediction to determine the similarity between words and phrases.