Summary of NLP knowledge points
Basic knowledge
Cross entropy, KL divergence, maximum likelihood estimation and maximum posterior estimation
Word vector
Detailed explanation of word vectors: from word2vec, glove, ELMo to BERT
Cross entropy, KL divergence, maximum likelihood estimation and maximum posterior estimation
Detailed explanation of word vectors: from word2vec, glove, ELMo to BERT