论文《Distributed Representations of words and Phrase and their Compositionality》 - 代码天地

论文《Distributed Representations of words and Phrase and their Compositionality》

其他 2018-05-30 03:17:50 阅读次数: 0

放在开头:强推这位大神写的博客，对word2vec的原理讲的贼清楚!
博客地址1:https://www.cnblogs.com/pinard/p/7160330.html
博客地址2:http://www.cnblogs.com/pinard/p/7243513.html
博客地址3:http://www.cnblogs.com/pinard/p/7249903.html

论文首先介绍了希望实现的词向量表示能够实现 Vec(“Madrid”)-Vec(“Spain”)+Vec(“France”) = Vec(“Paris”)
Skip-gram(用中心词去预测周围2c的词) 的模型结构如下:

skip-gram模型训练的目标函数如下,对于给定的词序列w1,w2,w3….wt，其中c指的是窗口的大小。期望能最大化下面的概率值:

1，对于上式的理解可以认为里面的求和符号的作用是用来求一个词wt作为中心词时，用它来预测周围的2c个词的概率值，显然这里用了log来处理概率积的形式。
2，然后外围的求和符号的作用是求这个词序列w1,w2,w3,..wt每个词都作为中心词来预测周围词时概率加和的最大值。

然后作者说计算p(wt+j | wt)的方式是如图所示，这里反正我是没能看懂。。

但是参考文章开头的大神的博客地址2，我们可以另辟蹊径去理解这个。

猜你喜欢

转载自blog.csdn.net/u010995990/article/details/79803960

论文《Distributed Representations of words and Phrase and their Compositionality》

Distributed Representations of Words and Phrases and their Compositionality论文记录

Distributed Representations of Words and Phrases and their Compositionality

Distributed Representations of Words and Phrases and their Compositionality论文阅读及实战

Distributed Representations of Words and Phrases and their Compositionality翻译与感悟

(29)[NIPs13] Distributed Representations of Words and Phrases and their Compositionality

【论文阅读】Advances in Pre-Training Distributed Word Representations

Phrase

Question Retrieval with Distributed Representations and Participant Reputation in Community QA论文笔记

Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation(2014)代

读论文 Deep Recursive Neural Networks for Compositionality in Language

读论文 Semantic Compositionality through Recursive Matrix-Vector Spaces

读论文Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank

Distributed Representations of Sentences and Documents阅读笔记

《Document Classiﬁcation by Inversion of Distributed Language Representations》分享

文本相似度：Distributed Representations of Sentences and Documents

graph2vec: Learning Distributed Representations of Graphs 代码解读

论文翻译——Deep contextualized word representations

论文阅读：DeepWalk Online Learning of Social Representations

[论文分享] Function Representations for Binary Similarity

【论文笔记】Deep contextualized word representations

自然语言处理入门以及TensorFlow官网教程Vector Representations of words简介

(28)[AISTATS15] Joint Learning of Words and Meaning Representations for Open-Text Semantic Parsing

Words

【论文导读】SoundNet: Learning Sound Representations from Unlabeled Video

论文翻译《Unsupervised Discovery of Object Landmarks as Structural Representations》

论文《Efficient Estimation of Word Representations in Vector Space》阅读心得

论文笔记：Learning Attribute-Specific Representations for Visual Tracking

GraRep: Learning Graph Representations with Global Structural Information论文解读（翻译）

论文阅读笔记：Glyce: Glyph-vectors for Chinese Character Representations

今日推荐

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

《2024 年一季度互联网投融资运行情况》研究报告

报告：Django 仍然是 74% 开发者的首选

周排行

返回指定时间格式

fopen函数中的mode参数

Java 单例模式探讨

Flex remoteobject工作原理探讨

寻找mplayer的便捷安装方法

30天了解30种技术系列---(26)MySQL自动化运维工具Inception

关于Jboss/Tomcat/Jetty的JNDI定义123

程序减肥，strip，eu-strip 及其符号表

AsyncTask、View.post(Runnable)、ViewTreeObserver三种方式总结frame animation自动启动

Json和Bean的互相转换

每日归档

更多

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)