[DataSet] Artificial intelligence data sets commonly used in natural language processing

Common artificial intelligence datasets for natural language processing

Dataset category Common Datasets
Text Classification Dataset Toutiao Chinese news (short text) classification data set; Tsinghua news classification corpus data set dmsc_v2 data set; ChnSentiCorp htl all data set
Font Recognition Dataset boson dataset; MSRA Microsoft Asia Research dataset; SIGHAN Bakeoff 2005 dataset
Search for matching datasets query-title semantic matching dataset; SogouE dataset; ez douban dataset; yfdianping dataset
Recommender System Dataset MovieLens dataset; Jester dataset; Book-Crossings dataset; Lastfm dataset; Wikipedia dataset; OpenStreetMap dataset

おすすめ

転載: blog.csdn.net/qq_44824148/article/details/129859743