Transformer【Attention is all you need】 - 代码天地

Transformer【Attention is all you need】

其他 2018-12-31 15:31:19 阅读次数: 0

前言

Transfomer是一种encoder-decoder模型，在机器翻译领域主要就是通过encoder-decoder即seq2seq，将源语言(x1, x2 ... xn) 通过编码，再解码的方式映射成（y1, y2 ... ym), 之前的做法是用RNN进行encode-decoder,但是由于RNN在某一时间刻的输入是依赖于上一时间刻的输出，所以RNN不能并行处理，导致效率低效，而Transfomer就避开了RNN，因此encoder-decoder效率高。

Transformer

从一个高的角度来看Transformer，它就是将源语言转换成目标语言

打开Transformer单元，我们会发现有两个部分组成，分别是encoders单元和decoders单元

而对于encoders单元，它是由六个encoder组成的，同样decoders单元，它也是由六个decoders组成。

对于每一个encoder，它们结构都一样的，但是权重不共享，每一个encoder的结构都是由两部分组成，分别是self-attention和feed forward neural network。

Transformer的处理流程是这样的：输入数据传给self-attention，然后selft-attention计算每一个位置的与其他位置的相关性，从而获得每一个位置的输出结果，该输出结果传给FFNN，得到第一个encoder的输出z_1，z₁作为第二个encoder的输入，步骤如上，直到最后一个encoder输出 ouput。

该输出ouput，在传给decoder，大致过程和encoder一致，有些许差异，稍后分析。

具体示例：

参考：

https://jalammar.github.io/illustrated-transformer/

猜你喜欢

转载自www.cnblogs.com/zhaopAC/p/10202071.html

Transformer【Attention is all you need】

Transformer：Attention Is All You Need

Transformer —— attention is all you need

Attention Is All You Need（Transformer ）

transformer(attention is all you need)

【Transformer】Attention Is All You Need

Attention Is All You Need（Transformer）原理小结

bert之transformer（attention is all you need）

Attention is all you need-详解Transformer

Attention is all you need中Transformer方法

【笔记】Transformer 框架：Attention is all you need

【论文阅读】Attention is all you need（Transformer）

Transformer 论文精读——Attention Is All You Need

Attention is All You Need（Transformer入门）

Transformer-《Attention Is All You Need》

Attention is all you need

Attention all you need

《Attention Is All You Need》

【论文解读】Attention Is All You Need（Transformer and Self-Attention）

Transformer--Attention is All You Need (推荐--非常详细)

《Attention is All You Need》论文理解Transformer

论文笔记Transformer:Attention is all you need

【Transformer开山之作】Attention is all you need原文解读

大语言模型之一 Attention is all you need ---Transformer

【笔记记录】Transformer架构（Attention is all you need）

读懂「Attention is All You Need」|

对Attention is all you need 的理解

Attention is All You Need 理解

Attention is All You Need -- 浅析

paper:Attention Is All You Need

今日推荐

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

周排行

[编程题]学英语

[codeforces 1288A] Deadline 约数+模

Python的web开发

Docker在Centos 7上的部署

python编码

解决Ubuntu16.04 fatal error: json/json.h: No such file or directory

mysql并发插入

rest接口如何适应jsonp的方案

linux 终端上网设置

高数——等号两边同时求导、积分的解释

每日归档

更多

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)