【论文阅读笔记】Linguistic Knowledge and Transferability of Contextual Representations - 代码天地

【论文阅读笔记】Linguistic Knowledge and Transferability of Contextual Representations

其他 2019-06-23 22:43:31 阅读次数: 0

版权声明：本文为博主原创文章，未经博主允许不得转载。 https://blog.csdn.net/cskywit/article/details/89428157

本文发布在arxiv 2019 preprint

通过十六种不同的探究任务来研究语境化知识和语境化词语表示的可迁移性。预训练的上下文相关词向量足以在广泛的NLP任务中实现高性能。对于需要特定信息但未被上下文单词表示捕获的任务，学习特定任务的上下文特征有助于在词向量中编码必要的知识。此外，对情境化层的可迁移性模式的分析表明，LSTM的最低层编码最具可迁移的特征，而Transoformer的中间层是最具可迁移性的。LSTM中的更高层更具有任务特定性（因而不那么一般），而Transformer在没有表现出同样的单调增长。先前的工作已经表明，更高级别的上下文层可以明确地编码更高级别的语义信息。相反，似乎某些高级语义现象对情境化的前期任务非常有用，导致他们出现在更高层。最后，双向语言模型预训练产生的表示通常比11个其他可以执行的预训练任务更易于迁移。

猜你喜欢

转载自blog.csdn.net/cskywit/article/details/89428157

【论文阅读笔记】Linguistic Knowledge and Transferability of Contextual Representations

论文笔记《Knowledge Enhanced Contextual Word Representations》

论文阅读：Segmentation Transformer: Object-Contextual Representations for Semantic Segmentation

Lecture 13: Contextual Word Representations and Pretraining

【论文笔记】Enhancing Pre-Trained Language Representations with Rich Knowledge for MRC

Context is Key: Grammatical Error Detection with Contextual Word Representations翻译

阅读笔记之——Contextual Loss

论文阅读 | ACL2019 Chinese Relation Extraction with Multi-Grained Information and External Linguistic Knowledge

论文阅读课8-Chinese Relation Extraction with Multi-Grained Information and External Linguistic Knowledge

论文阅读笔记：《Contextual String Embeddings for Sequence Labeling》

论文阅读笔记：Distilling the Knowledge in Neural Network

Distilling the Knowledge in a Neural Network[论文阅读笔记]

How Contextual are Contextualized Word Representations in BERT、ELMO and GPT-2 Embeddings

论文阅读笔记：Glyce: Glyph-vectors for Chinese Character Representations

【自监督论文阅读笔记】Unsupervised Learning of Dense Visual Representations

Local Map-Based DQN Navigation and a Transferability Metric Using Scene Similarity 论文阅读

论文阅读：DeepWalk Online Learning of Social Representations

Contextual Action Recognition with R*CNN-论文阅读

[GAN]Generative Image Inpainting with Contextual Attention论文阅读

论文阅读之《DeepIlluminance: Contextual IlluminanceEstimation via Deep Neural Networks》

When Does Machine Learning FAIL? Generalized Transferability for Evasion and Poisoning Attacks论文笔记...

【论文译文】Generative Image Inpainting with Contextual Attention

论文笔记：Distilling the Knowledge

[论文解读]Explaining Knowledge Distillation by Quantifying the Knowledge

Distributed Representations of Sentences and Documents阅读笔记

论文阅读笔记—Reasoning on Knowledge Graphs with Debate Dynamics（AAAI，2020）

论文阅读笔记《The Contextual Loss for Image Transformationwith Non-Aligned Data》（ECCV2018 oral）

论文阅读笔记《The Contextual Loss for Image Transformationwith Non-Aligned Data》（ECCV2018 oral）

【论文笔记】Deep contextualized word representations

flair embedding--《Contextual String Embeddings for Sequence Labeling》阅读笔记

今日推荐

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

周排行

浏览器对同一域名进行请求的最大并发连接数

React Hook之自定义Hook

【转】MyBatis缓存机制

-Java-泛型

自动化测试常用脚本-发送邮件

LeetCode#859: Buddy Strings

java、Python处理字符串

第二篇の博客

Hadoop伪分布式环境安装

SQL Server进阶（十一）临时表、表变量

每日归档

更多

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)