NLP practice (news text classification)-competition question understanding and ideas

Others 2021-01-27 08:25:46 views: null

Question understanding and ideas

Question understanding
- - data collection
Competition questions

Question understanding

data collection

This competition is an entry-level competition for Tianchi NLP, and the operation is the same as usual. Register first, then get the data.
Insert picture description here
Pay attention to the standards.

Competition questions

Since the data given in the question is anonymized, we cannot use operations such as word segmentation to extract keywords for simple prediction. What we can use is a classifier that extracts features from text or a deep learning classifier. In general, we have the following ideas :

Idea 1: TF-IDF + machine learning classifier: directly use TF-IDF to extract features from the text, and use the classifier to classify. In the choice of classifier, you can use SVM, LR, or XGBoost.
Idea 2: FastText: FastText is an entry-level word vector. Using the FastText tool provided by Facebook, you can quickly build a classifier.
Idea 3: WordVec +
deep learning classifier: WordVec is an advanced word vector, and the classification is completed by constructing a deep learning classification. The network structure of deep learning classification can choose TextCNN, TextRNN or BiLSTM.
Idea 4: Bert word vector: Bert is a highly matched word vector, with powerful modeling learning capabilities.

We will implement them one by one in the future.

I also hope that I can stick to this competition

Guess you like

Origin blog.csdn.net/weixin_45696161/article/details/107475518

NLP practice (news text classification)-competition question understanding and ideas

NLP practice (news text classification)-data reading and data analysis

Text classification in practice - NLP

[DataWhale Learning Record 15-01] Zero-based Introductory NLP-News Text Classification Competition Questions-01 Competition Questions Understanding

Tianchi NLP Competition-News Text Classification (4)-Text Classification Based on Deep Learning 1-FastText

Tianchi NLP Competition-News Text Classification (6)-Text Classification Based on Deep Learning 3-BERT

Tianchi NLP Competition-News Text Classification (3)-Text Classification Based on Machine Learning

Tianchi NLP Competition-News Text Classification (1)-Comprehension of Competition Questions

NLP news text classification-Task5

NLP news text classification-Task4

NLP news text classification-Task3

Tianchi NLP Competition-News Text Classification (2)-Data Reading and Data Analysis

[nlp] Tianchi Learning Competition-News Text Classification-Machine Learning

Tianchi NLP Competition-News Text Classification (5)-Text Classification Based on Deep Learning 2-TextCNN, TextRNN

Convolutional Neural Networks in Practice in NLP: Text Classification

Introduction and practice of text classification model in NLP

NLP-LSTM text classification model practice

Zero-based entry NLP news text classification

NLP practice: Pytorch-based text classification entry practice

NLP Practice - Text Classification Based on Weakly Labeled Data

Zero-based entry NLP news text classification_Task2

Zero-based entry NLP news text classification_Task4

Zero-based entry NLP news text classification_Task3

Datawhale-zero-based introduction to NLP-news text classification Task06

Datawhale-zero-based introduction NLP-news text classification Task05

Datawhale-zero-based introduction NLP-news text classification Task04

Datawhale-zero-based introduction NLP-news text classification Task03

Datawhale-zero-based introduction NLP-news text classification Task02

Datawhale-zero-based entry NLP-news text classification Task01

NLP text classification problem

Recommended

Linus is the most active in "eating dog food"!

Ranking

Share good programmer web front-end array and sorting, de-duplication and random roll call

Compilation error caused by cv_bridge and python version problems error: return-statement with no value, in function returning'void*' [-fpe

魔众帮助中心系统 v3.1.0 首页切换器，界面优化

Die beim Millimeterwellenradar-Integrationstest aufgetretene Grube (Multiprozessbindung an einen UDP-Port verursacht Probleme)

How to suppress the "requires transitive directive for an automatic module" warning properly?

LeetCode-1743. Restore the Array From Adjacent Pairs-Analysis and Code (Java)

Summer 2019 Summer soft essay 7 workers

Python中Assert断言的使用语法和例子

LeetCode one question per day (2021-2-3 sliding window median)

Fairchild, the ancestor of semiconductors, the legend of the first trillion-dollar start-up

Daily

More

2024-05-20(5)

2024-05-19(0)

2024-05-18(31)

2024-05-17(6)

2024-05-16(23)

2024-05-15(5)

2024-05-14(9)

2024-05-13(8)

2024-05-12(28)

2024-05-11(32)