关于self-attention - 代码天地

关于self-attention

编程语言 2023-07-01 06:32:51 阅读次数: 0

自用，笔记整理。

self-attention模型输入的xi先做embedding得到ai每个xi都分别乘上三个不同的w得到q、k、v。

其中：

拿每个qi去对每个ki做点积得到a1,i，其中d是q和k的维度。

再把a1,i经过一个Soft-max之后得到

接下来把得出第一个输出b1同理可得到所有bi

那么self attention是这么做平行化的呢？

将a穿起来合并成矩阵I与wq相乘，得到q们，组成矩阵Q，同理得到K,V

对于a1,1只要将矩阵和矩阵相乘就行。然后对每一列做一个soft-max得到带帽的a矩阵

最后将带帽a与所有v构成的矩阵V相乘即可输出。

总结：

self-attention的变形——Multi-head Self-attention

Multi-head Self-attention跟self-attention一样都会生成q、k、v，但是Multi-head Self-attention会再将q、k、v分裂出多个q1,2（这里举例分裂成两个），然后它也将q跟k去进行相乘计算，但是只跟其对应的k、v进行计算，比如q1,1只会与k1,1 、k2,1进行运算，然后一样的乘以对应的v得到输出b1,1。

猜你喜欢

转载自blog.csdn.net/weixin_43681559/article/details/129429266

关于self-attention

Attention与Self-Attention

Self-Attention（什么是Self-Attention）

Self-attention详解

Self-Attention与Transformer

Self-attention

Self-attention & Transformer

Attention 和self-attention

Self-Attention GAN 中的 self-attention 机制

Transformer中的Self-Attention

Self-Attention 和 Transformer

self-attention与softmax的推导

self-attention与Transformer补充

On the Integration of Self-Attention and Convolution

self-attention学习笔记

Self-Attention运行过程

self-attention的通俗解释

【AI】12_Attention and Self-Attention

NLP 3.4 Attention，self-attention

浅谈Attention与Self-Attention的前世今生

self-attention和cross-attention

attention,self-attention,multihead attention,Transformer【亟待解决】

自注意力(self-attention)

SAGAN——Self-Attention Generative Adversarial Networks

基于self-attention检测lstm后门

学习笔记（二）__Self-Attention及Transformer

NLP入门（4）— Self-attention & Transformer

Self-Attention Generative Adversarial Networks

李宏毅self-attention学习

自注意力（Self-Attention）

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)