linear self attention 的pytorch实现和使用 - 代码天地

linear self attention 的pytorch实现和使用

其他 2019-01-16 10:31:03 阅读次数: 0

版权声明：本文为博主原创文章，未经博主允许不得转载。 https://blog.csdn.net/guotong1988/article/details/86502457

# For summarizing a set of vectors into a single vector
class LinearSelfAttn(nn.Module):
    """Self attention over a sequence:
    * o_i = softmax(Wx_i) for x_i in X.
    """
    def __init__(self, input_size):
        super(LinearSelfAttn, self).__init__()
        self.linear = nn.Linear(input_size, 1)

    def forward(self, x, x_mask):
        """
        x = [batch, len, hdim]
        x_mask = [batch, len]
        """
        x = dropout(x, p=my_dropout_p, training=self.training)

        x_flat = x.contiguous().view(-1, x.size(-1))
        scores = self.linear(x_flat).view(x.size(0), x.size(1))
        scores.data.masked_fill_(x_mask.data, -float('inf'))
        alpha = F.softmax(scores, dim=1)
        return alpha # [batch, len]

# bmm: batch matrix multiplication
# unsqueeze: add singleton dimension
# squeeze: remove singleton dimension
def weighted_avg(x, weights): 
    """ x = [batch, len, d]
        weights = [batch, len]
    """
    return weights.unsqueeze(1).bmm(x).squeeze(1)

使用：

# [batch,sentence_len,hidden_dim] -> [batch,sentence_len]
sentence_weights = linear_self_attn(sentence_hiddens, sentence_mask) 

# [batch,hidden_dim]
sentence_avg_hidden = weighted_avg(sentence_hiddens, sentence_weights)

猜你喜欢

转载自blog.csdn.net/guotong1988/article/details/86502457

linear self attention 的pytorch实现和使用

2021 《Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks》 Pytorch实现

self attention pytorch代码

Attention 和self-attention

keras实现的self attention

self-attention和cross-attention

Self-Attention 和 Transformer

pytorch实现self-attention机制，并可视化

PyTorch——实现自注意力机制（self-attention）

self attention

attention与self attention的区别

Attention与Self-Attention

Transformer中self-attention实现

Seq2Seq中的Attention和self-attention

self.attention 和attention 有什么区别

手撕self-attention代码_从0实现self-attention_附学习路线

Self-Attention（什么是Self-Attention）

pytorch代码实现注意力机制之Parallel Polarized Self Attention

Self-attention详解

Self-Attention与Transformer

Self-attention

CogView中的Self Attention

关于self-attention

Self-attention & Transformer

Python3.6和tensorflow1.14实现Bi-LSTM+Self-Attention+CRF实现命名实体识别

self-attention 和 convolutional layer 之间的关系

self-attention的介绍和代码手写

2021刷新COCO和Cityscapes | Polarized Self-Attention：极化自注意力机制（keras实现）

【深度学习模型】cv中Attention的奇妙旅途——讲讲Self-Attention, SENet和CBAM

TransformerVision（一）|| Self-Attention和MultiHead Self-Attesntion原理

今日推荐

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

报告：Django 仍然是 74% 开发者的首选

《2024 年一季度互联网投融资运行情况》研究报告

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

周排行

计算机组成与设计（七）—— 除法器

Integer Approximation(分治+枚举)

大话数据库索引

windows10系统JDK的配置及下载地址

mysql实现秒值转换中原六仔平台搭建

Codeforces Round #556 (Div. 1)

百练1064 网线主管

Codeforces 995F Cowmpany Cowmpensation

子集生成之增量构造法，位向量法，二进制法

ERROR: cmd.exe failed with args /c "/APK\gradle\rungradle.bat...

每日归档

更多

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)