[nlp] Introduction to TF-IDF algorithm

(1) TF is the term frequency (Term Frequency)

Term frequency is the probability that a word occurs in a document.

(2) IDF is Inverse Document Frequency (Inverse Document Frequency)

The fewer documents that contain terms, the larger the IDF.

Guess you like

Origin blog.csdn.net/Trance95/article/details/132004950