ELMo（Embeddings from Language Models） --学习笔记 - 代码天地

ELMo（Embeddings from Language Models） --学习笔记

其他 2018-11-12 19:26:58 阅读次数: 0

版权声明：转载请声明转自Juanlyjack https://blog.csdn.net/m0_38088359/article/details/83904566

学习参考自：
（1）、ELMo 最好用的词向量《Deep Contextualized Word Representations》
（2）、吾爱NLP(5)—词向量技术-从word2vec到ELMo
（3）文本嵌入的经典模型与最新进展

1、ELMo简介

基于大量文本，ELMo模型从深层的双向语言模型（deep bidirectional language model）中的内部状态(internal state)学习而来。
ELMo的优势：
（1）ELMo能够学习到词汇用法的复杂性，比如语法、语义。
（2）ELMo能够学习不同上下文情况下的词汇多义性。
在这里插入图片描述

ELMo与word2vec最大的不同：
Contextual: The representation for each word depends on the entire context in which it is used.　
（即词向量不是一成不变的，而是根据上下文而随时变化，这与word2vec或者glove具有很大的区别）

引例：
举个例子：针对某一词多义的词汇w=“苹果”
文本序列1=“我买了六斤苹果。”
文本序列2=“我买了一个苹果 7。”
上面两个文本序列中都出现了“苹果”这个词汇，但是在不同的句子中，它们的含义显示是不同的，一个属于水果领域，一个属于电子产品领域，如果针对“苹果”这个词汇同时训练两个词向量来分别刻画不同领域的信息呢？答案就是使用ELMo。

它首先在大文本语料库上预训练了一个深度双向语言模型（biLM），然后把根据它的内部状态学到的函数作为词向量。实验表明，这些学到的词表征可以轻易地加入到现有的模型中，并在回答问题、文本蕴含、情感分析等 6 个不同的有难度的 NLP 问题中大幅提高最佳表现。实验表明显露出预训练模型的深度内部状态这一做法非常重要，这使得后续的模型可以混合不同种类的半监督信号。

2、ELMo原理

在这里插入图片描述

3、ELMo的使用方法

在这里插入图片描述

4、ELMo的安装

ELMo集成在了基于pytorch的allennlp这个工具包当中，所以我们要是用ELMo必须要先安装pytorch，然后再安装allennlp。
安装请参照官网指示。

注意：allennlp这个包目前不支持windows，所以要使用allennlp必须要在linux上进行安装使用。代码如下：

conda install pytorch -c pytorch
pip3 install torchvision
pip install allennlp

猜你喜欢

转载自blog.csdn.net/m0_38088359/article/details/83904566

ELMo（Embeddings from Language Models） --学习笔记

【NLP-13】ELMo模型（Embeddings from Language Models）

转：Language Models as Knowledge Embeddings

【提示学习】AUTOPROMPT: Eliciting Knowledge from Language Models with Automatically Generated Prompts

深度学习论文: Learning Transferable Visual Models From Natural Language Supervision

Sequence Models(Week2)---Natural Language Processing & Word Embeddings

Object constraint language for code generation from activity models

clip:learning transferable visual models from natural language supervision

CLIP : Learning Transferable Visual Models From Natural Language Supervision

Fine-Tuning Language Models from Human Preferences

【论文视频】Clip：Learning Transferable Visual Models From Natural Language Supervision【多模态，对比学习，迁移学习】

论文解读：从自然语言监督学习可转移视觉模型Learning Transferable Visual Models From Natural Language Supervision

Encoder-Decoder Models Can Benefit from Pre-trained Masked Language Models in GEC翻译

【论文&模型学习】从自然语言监督中学习可迁移视觉 CLIP（Learning Transferable Visual Models From Natural Language Supervision）

Coursera, Deep Learning 5, Sequence Models, week2, Natural Language Processing & Word Embeddings

【NLP】Conditional Language Models

The rise of language models

【论文&模型讲解】CLIP（Learning Transferable Visual Models From Natural Language Supervision）

CLIP论文翻译、Learning Transferable Visual Models From Natural Language Supervision翻译

Information Retrieval Meets Large Language Models: A Strategic Report from Chinese IR Community

吴恩达ChatGPT《Finetuning Large Language Models》笔记

Rasa 3.x 学习系列-Benchmarking Language Models

【GPU Gems 学习笔记】Effective Water Simulation from Physical Models

cs224n学习笔记L6: Language models and RNNs

Adapting Language Models to Compress Contexts

Challenges and Applications of Large Language Models

A Survey of Large Language Models Attribution

Large Language Models in Finance: A Survey

Language Models, Agent Models, and World Models: The LAW for Machine Reasoning and Planning

Python和TensorFlow2实现ELMO（Embedding From Language Model）模型，并对源码做了一些改进

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)