vectorizer = CountVectorizer () # construct a computing term frequency ( TF ) stuff, of course, there is a lack of these can be done
transformer = TfidfTransformer () # build a computing TF-IDF stuff
TfidfTransformer + CountVectorizer = TfidfVectorizer
Python in TfidfVectorizer parameter parsing: https://blog.csdn.net/feng_zhiyu/article/details/81952697
TF-IDF algorithm introduction and realization : https://blog.csdn.net/asialee_bird/article/details/81486700