End to End Memory network

其他 2019-03-06 09:51:03 阅读次数: 0

版权声明：本文为博主原创文章，未经博主允许不得转载。 https://blog.csdn.net/LaineGates/article/details/79140583

关键词

End2End, Memory Networks, Multiple hops

来源

arXiv 2015.03.31 (published at NIPS 2015)

特色

设计了全新网络，相对于LSTM，以词为单位的时序，memory network是以句子为单位。

解决方案

原图
这里写图片描述
加备注图

计算过程

按原图

lookup词表A获得句子向量表示，
$m_i=Ax_i$ , $i$ 大小是memory size
计算attention，或者说计算输入的权重
$p_i = softmax(u^T m_i)$

将输出乘权重，得到最终的输出o
输出的嵌入向量
$c_i = C out_i$ , $i$ 大小是memory size
最终输出嵌入向量
$o = \sum_{i} p_i c_i$
查询的嵌入向量
$u=Bq$
预测结果
$\hat{a}=softmax(W(o+u))$

按实现代码

计算过程与原图不一致，我按论文的实现代码做了标注，参见备注图。
输入sentences和query时，都有矩阵TA和TB矩阵
即
$A_{in}=Ax_i +T_Ax_i$ , i代表句子，长度固定为memory size
$A_{out}=A_{in} H_{last}$ , H代表隐藏层, $A_{out}可看作m_i$
$p_i=softmax(A_{out})$
$B_{in}=Bq + T_Bq$
$B_{out}=p B_{in}$
$C_{out}=H_{last} B_{out}$
$D_{out}=C_{out} B_{out}$
最后，保存 $D_{out}$ 为新的Hidden

多层网络

原文提供两种方式。
第一种是邻接，即 $A_{k+1}=C_k$ ，依次递推
第二种是类似于 RNN 中共享权重的模式， $A_1=A_2=…=A_k$ ， $C_1=C_2=…=C_k$ 。
其余与单层网络一致。

参考代码

facebook实现，使用Lua语言
网友实现，使用tensorflow

猜你喜欢

转载自blog.csdn.net/LaineGates/article/details/79140583

End to End Memory network

End-To-End Memory Network 学习整理

记忆网络模型系列之End to End Memory Network

【记忆网络 2 】 End-to-End Memory Network

记忆网络模型End-to-End Memory Network论文阅读笔记

End-To-End Memory Networks 论文阅读

HybridNets: End-to-End Perception Network

end=‘’

end

end()

The End

end to end

[转载]记忆网络之End-To-End Memory Networks

End-to-end detection-segmentation network with ROI convolution

【阅读笔记】《An End-to-End Network for Panoptic Segmentation》

in the end，at the end， at an end的用法区别

什么是end to end 学习

Towards End-to-end

End-to-end Learning

深度之眼Paper带读笔记NLP.20：End-to-End Memory Networks

CRT detected that the application wrote to memory after end of heap buffer

RPAN：An End-to-End Recurrent Pose-Attention Network for Action Recognition in Videos

ICCV 2017 《Towards End-to-End Text Spotting with Convolutional Recurrent Neural Network》论文笔记

白翔2018Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shap

Explaining How a Deep Neural Network Trained with End-to-End Learning Steers a Car论文笔记

A Network-based End-to-End Trainable Task-oriented Dialogue System

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

论文学习：《A network-based end-to-end trainable task-oriented dialogue system》

Dynamic Fusion Network for Multi-Domain End-to-end Task-OrientedDialog

Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)