7.00.02 - Output - Aster Analytics

Teradata Aster® Analytics Foundation User GuideUpdate 2

Product
Aster Analytics
Release Number
7.00.02
Published
September 2017
Content Type
Programming Reference
User Guide
Publication ID
B700-1022-700K
Language
English (United States)
Last Update
2018-04-17
TF_IDF Output Schema
Column Name Data Type Description
docid Any Document identifier of document d.
term VARCHAR Term t.
tf DOUBLE PRECISION Term frequency of term t in document d, calculated as specified by the Formula argument.
idf DOUBLE PRECISION Inverse document frequency of term t in document d, calculated by this formula:

IDF(t) = log (doccount / doccount(t))

where doccount is the number of documents in the document set and doccount(t) is the number of documents that contain the term t.

tf_idf DOUBLE PRECISION TF_IDF score of of term t in document d, calculated by this formula:

TF_IDF(t, d) = TF(t, d) * IDF(t)