BOOL |
Boolean frequency: tf(t,d)= 1 if t occurs in d; otherwise tf(t,d)= 0
|
COUNT |
Raw frequency tf(t,d)= f(t,d)
where f(t,d) is the number of times t occurs in d (that is, raw frequency, rf).
|
NORMAL |
Normalized frequency tf(t,d)= f(t,d) / sum {w : w ∈ d}
This value is rf divided by the number of terms in the document.
|
LOG |
Logarithmically-scaled frequency: tf(t,d)= 1 + log(f(t,d))
This value is the natural logarithm of rf.
|
AUGMENT |
Augmented frequency, which prevents bias towards longer documents: tf(t,d)= 0.5 + (0.5 × f(t,d) / max {f(w,d) : w ∈ d})
This value is rf divided by the maximum raw frequency of any term in the document.
|