Column Name | Data Type | Description |
---|---|---|
docid | Any | Document identifier of document d. |
term | VARCHAR | Term t. |
tf | DOUBLE PRECISION | Term frequency of term t in document d, calculated as specified by the Formula argument. |
idf | DOUBLE PRECISION | Inverse document frequency of term t in document d, calculated by this formula: IDF(t) = log (doccount / doccount(t)) where doccount is the number of documents in the document set and doccount(t) is the number of documents that contain the term t. |
tf_idf | DOUBLE PRECISION | TF_IDF score of of term t in document d, calculated by this formula: TF_IDF(t, d) = TF(t, d) * IDF(t) |