TFIDF Output - Teradata Vantage

Machine Learning Engine Analytic Function Reference

Product
Teradata Vantage
Release Number
9.02
9.01
2.0
1.3
Published
February 2022
Language
English (United States)
Last Update
2022-02-10
dita:mapPath
rnn1580259159235.ditamap
dita:ditavalPath
ybt1582220416951.ditaval
dita:id
B700-4003
lifecycle
previous
Product Category
Teradata Vantageā„¢

Output Schema

Column Data Type Description
docid Any Document identifier of document d.
term VARCHAR Term t.
tf DOUBLE PRECISION Term frequency of term t in document d, calculated as specified by Formula syntax element.
idf DOUBLE PRECISION Inverse document frequency of term t in document d, calculated by this formula:

IDF(t) = log (doccount / doccount (t))

where doccount is the number of documents in the document set and doccount (t) is the number of documents that contain the term t.

tf_idf DOUBLE PRECISION TFIDF score of term t in document d, calculated by this formula:

TFIDF(t, d) = TF(t, d) * IDF(t)