Deep Syntax-Semantics Model（2020 EMNLP） - 代码天地

Deep Syntax-Semantics Model（2020 EMNLP）

其他 2021-02-28 10:16:12 阅读次数: 0

在这里插入图片描述

 Improving Text Understanding via Deep Syntax-Semantics Communication

动机

Syntax-Tree model与sequential semantic model相结合，提高下游任务性能。

Introduction

句子中句法和语义的比较。相同的颜色表示相同(相似)的语义目标。

在这里插入图片描述

Model

多层结合模型
在这里插入图片描述

定义

句子 $S$ = { $w_1$ ,…, $w_n$ }，对应的sequential 表示：
在这里插入图片描述
树表示：

Sequential Encoder

Bi-LSTM：
在这里插入图片描述

Tree Encoder（TreeLSTM or GCN）

TreeLSTM：Bi-LSTM：
在这里插入图片描述
两个方向连接：

GCN：

$N$ $（$ $j$ $）$ 表示邻居节点，取GCN最后一层的输出作为树表示：

Deep Communication Model

将Sequential encoder 和Tree encoder视为一个完整的unit。
在这里插入图片描述

Local Interaction

Local Interaction的动机是鼓励sequential encoder和Tree encoder 从彼此的信息传播模式中学习更多（attenton）。
首先，当前步骤 $t$ 的sequential encoder 中的每个节点将上一时间步骤的相邻节点作为附加输入:
在这里插入图片描述
普通的attention：

$n$ $b$ $s$ 表示邻居
同样，Tree encoder 亦是如此：

Sentence-level Global Propagation

将 Deep Communication Model中的 $h$ $^a$ $^l$ $^l$ $_i$ 更新为：
$h$ $^a$ $^l$ $^l$ $_i$ = [ $h$ $^s$ $^e$ $^q$ $^，$ $^t$ $_i$ , $h$ $^t$ $^r$ $^e$ $^e$ $^，$ $^t$ $_i$ ]
采用一个gate机制（只有输入门）进行global 传播：
在这里插入图片描述
上面带横线的 $h$ 表示ungated 数值。

扫描二维码关注公众号，回复： 12559740 查看本文章

Decoding and Training

Inner - attention:
在这里插入图片描述

如果是自然语言推理任务：
两个句子被表示：

然后输出一个概率。
如果是分类任务，直接对输出softmax，取最大值作为被分类类别：

LOSS:交叉熵+ $L_2$ 正则化
在这里插入图片描述

冷启动

为了避免冷启动训练，分别预训练独立的sequential encoder和tree encoder，然后在step 0 将它们的参数作为初始状态 $h_0$ 。

结果

event factuality prediction（EFP）：事件真实性预测
relation classification for drug-drug interaction（Rel）：药物相互作用的关系分类
semantic role labeling （SRL）：语义角色标注
在这里插入图片描述
自然语言推理任务：

Ablation Study

原文用的GloVe（88.2），实验证明用预训练模型对性能有大幅度提升。
在这里插入图片描述

猜你喜欢

转载自blog.csdn.net/qq_43390809/article/details/111588792

Deep Syntax-Semantics Model（2020 EMNLP）

Transformer for ranker（EMNLP 2020 ）

VD-BERT（统一视觉对话 2020 EMNLP）

跨语言检索的QA（google research EMNLP 2020）

DEEP & WIDE MODEL

2017emnlp-Author-aware Aspect Topic Sentiment Model to Retrieve Supporting Opinions from Reviews阅读笔记

Revisiting Self-Training for Few-Shot Learning of Language Model，EMNLP2021

Multi-Instance Multi-Label Learning Networks for Aspect-CategorySentiment Analysis（EMNLP 2020）

【论文笔记】Generating Radiology Reports via Memory-driven Transformer (EMNLP 2020)

ECCV2020(Oral) #开源项目# #GAN#《Rewriting a Deep Generative Model》重写深度生成模型

EMNLP - EMNLP 2023 样式文件和格式

【NeurIPS 2020】Deep Evidential Regression

EMNLP - 征集系统演示

Wide & deep Model：从Google到华为

深度学习之Deep Image CTR Model

Lecture 17 - Unsupervised Learning - Deep Generative Model

Model-Reuse Attacks on Deep Learning Systems

Deep Generative Model1--VAE

Model Space Exploration with Deep Neural Networks: An E

【笔记】TinyBERT(EMNLP2019)

EMNLP 2021 Transformers论文合集

EMNLP -- Call for Main Conference Papers

Deep RL Bootcamp Lecture 9 Model-based Reinforcement

【论文阅读笔记】---《A Survey of Model Compression and Acceleration for Deep Neural Networks》

推荐系统——A Hybrid Collaborative Filtering Model with Deep Structure for Recommender Systems

A deep tree-based model for software defect prediction

论文阅读：《A Deep Relevance Model for Zero-Shot Document Filtering》

The Wide and Deep Learning Model（译文+Tensorlfow源码解析）

论文阅读 | A Deep Relevance Matching Model for Ad-hoc Retrieval

A Survey of Model Compression and Acceleration for Deep Neural Network时s

今日推荐

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

报告：Django 仍然是 74% 开发者的首选

《2024 年一季度互联网投融资运行情况》研究报告

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

周排行

curl的POST请求，封装方法

8.1.1. Integer Types

Java基础 Day05(个人复习整理)

Python - Django - 中间件 process_exception

小L的试卷

【Shell编程】（函数）判断用户是否存在

python(css样式)

spring ant path 匹配原则 - 【笔记】

《JavaScript与JScript从入门到精通》(美)James.Jaworski.中译本.扫描版.pdf

Eclipse运行带参数的java程序

每日归档

更多

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)