[cs224n] Lecture 2 – Word Vectors and Word Senses

其他 2019-04-03 13:41:03 阅读次数: 0

版权声明：本文为博主原创文章，未经博主允许不得转载。 https://blog.csdn.net/weixin_37993251/article/details/88970045

Lecture 2 – Word Vectors and Word Senses

1. Review: Main idea of word2vec

Word2vec parameters and computations

Word2vec maximizes objective function by putting similar words nearby in space

2. Optimization: Gradient Descent

Gradient Descent

Stochastic Gradient Descent

Stochastic gradients with word vectors!

1b. Word2vec: More details

So far, we have looked at two main classes of methods to find word embeddings. The first set are count-based and rely on matrix factorization (e.g. LSA, HAL). While these methods effectively leverage global statistical information, they are primarily used to capture word similarities and do poorly on tasks such as word analogy, indicating a sub-optimal vector space structure. The other set of methods are shallow window-based (e.g. the skip-gram and the CBOW models), which learn word embeddings by making predictions in local context windows. These models demonstrate the capacity to capture complex linguistic patterns beyond word similarity, but fail to make use of the global co-occurrence statistics.

The skip-gram model with negative sampling (HW2)

In comparison, GloVe consists of a weighted least squares model that trains on global word-word co-occurrence counts and thus makes efficient use of statistics. The model produces a word vector space with meaningful sub-structure. It shows state-of-the-art performance on the word analogy task, and outperforms other current methods on several word similarity tasks.

3. But why not capture co-occurrence counts directly?

Example: Window based co-occurrence matrix

Window based co-occurrence matrix

Problems with simple co-occurrence vectors

Solution: Low dimensional vectors

Method 1: Dimensionality Reduction on X (HW1)

Simple SVD word vectors in Python

Hacks to X (several used in Rohde et al. 2005)

Interesting syntactic patterns emerge in the vectors

Count based vs. direct prediction

How to evaluate word vectors?

Intrinsic word vector evaluation

Glove Visualizations

Glove Visualizations: Company - CEO

Glove Visualizations: Superlatives

Details of intrinsic word vector evaluation

Analogy evaluation and hyperparameters

Analogy evaluation and hyperparameters

Another intrinsic word vector evaluation

Closest words to “Sweden” (cosine similarity)

Correlation evaluation

Word senses and word sense ambiguity

pike

Improving Word Representations Via Global Context And Multiple Word Prototypes (Huang et al. 2012)

Linear Algebraic Structure of Word Senses, with Applications to Polysemy

Extrinsic word vector evaluation

Course plan: coming weeks

Office Hours / Help sessions

猜你喜欢

转载自blog.csdn.net/weixin_37993251/article/details/88970045

[cs224n] Lecture 2 – Word Vectors and Word Senses

【NLP CS224N笔记】Lecture 2 - Word Vectors2 and Word Senses

2019 CS224N lecture2 Word Vectors and Word Senses

cs224n学习笔记L2:word vectors and word senses

Task 2: Word Vectors and Word Senses （附代码）（Stanford CS224N NLP with Deep Learning Winter 2019）

cs224n---lecture2: Word Vectors

Task 2: Word Vectors and Word Senses

【NLP CS224N笔记】Lecture 3 GloVe： Global Vectors for Word Representation

2019 CS224N lecture1 Introduction and Word Vectors

【NLP CS224N笔记】Lecture 1 - Introduction and Word Vectors

【NLP CS224N笔记】Lecture 2 - Word Vector Representations: word2vec

[cs224n] Lecture 2 | Word Vector Representations: word2vec

CS224N研究热点2_Linear Algebraic Structure of Word Senses, with Applications to Polysemy（对于一词多义的向量表示研究）

Task 1: Introduction and Word Vectors（附代码）（Stanford CS224N NLP with Deep Learning Winter 2019）

Word Vectors详解(2)

2019-CS224N-Assignment 1: Exploring Word Vectors

CS224n课堂笔记2-词的向量表示：word2vec

[cs224n].2 词向量表示word2vec

【CS224n笔记 (2) 】词向量表示word2vec

cs224n | 词向量表示：word2vec

CS224n笔记二之word2vec与softmax推导

第二讲 cs224n系列之word2vec & 词向量

CS224n assignment1 Q3 word2vec

CS224n 词的向量表示word2vec 之cbow（softmax negSampling ）

CS224n 词的向量表示word2vec 之skipgram （Negative sampling ）

CS224n学习笔记：Lecture1 & 2

［笔记］stanford engineering cs224n lecture2

cs224n | Word Window 分类与神经网络

CS224N（2019）——Introduction and Word Vector（一）

Stanford 224N- GloVe: Global Vectors for word representations

今日推荐

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

“百模大战”必有一战 | 2024中国“百模大战”竞争格局分析

最强开源大模型 Llama 3 上架 Gitee AI

虽然老乡鸡开源的不是代码，但背后的原因却让人很暖心

富文本编辑器 Quill 2.0 重磅发布，特性、可靠性与开发者体验大幅提升

周排行

使用Redis中间件解决商品秒杀活动中出现的超卖问题（使用Java多线程模拟高并发环境）

野指针及c++指针使用注意点

redis 3.0　新特性

(翻译)火狐操作系统javascript API

微信小程序开发入门

mysql数据查询之五子句(where、group by、having、order by和limit)

Codeforces Round #517 Div. 1翻车记

在caffe 中实现Generative Adversarial Nets（二）

企业级漏洞扫描工具

java byte数组与String互转

每日归档

更多

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)

2024-04-17(5)

2024-04-16(70)

2024-04-15(42)

2024-04-14(0)