torch.nn.Linear()函数的理解 - 代码天地

torch.nn.Linear()函数的理解

其他 2019-06-22 13:50:23 阅读次数: 0

import torch

x = torch.randn(128, 20) # 输入的维度是（128，20）
m = torch.nn.Linear(20, 30) # 20,30是指维度
output = m(x)
print('m.weight.shape:\n ', m.weight.shape)
print('m.bias.shape:\n', m.bias.shape)
print('output.shape:\n', output.shape)

# ans = torch.mm(input,torch.t(m.weight))+m.bias 等价于下面的
ans = torch.mm(x, m.weight.t()) + m.bias
print('ans.shape:\n', ans.shape)

print(torch.equal(ans, output))
1
2
3
4
5
6
7
8
9
10
11
12
13
14
m.weight.shape:
torch.Size([30, 20])
m.bias.shape:
torch.Size([30])
output.shape:
torch.Size([128, 30])
ans.shape:
torch.Size([128, 30])
True
1
2
3
4
5
6
7
8
9
为什么 m.weight.shape = (30,20)?

答：因为线性变换的公式是：

y=xAT+b y=xA^T+b
y=xA
T
+b

先生成一个（30，20）的weight，实际运算中再转置，这样就能和x做矩阵乘法了
---------------------
作者：m0_37586991
来源：CSDN
原文：https://blog.csdn.net/m0_37586991/article/details/87861418
版权声明：本文为博主原创文章，转载请附上博文链接！

猜你喜欢

转载自www.cnblogs.com/jfdwd/p/11068544.html

torch.nn.Linear()函数的理解

torch.nn.Linear()函数理解

torch.nn.Linear

torch.nn.Linear 笔记

torch.nn.linear函数具体使用案例

pytorch api torch.nn.Linear

关于torch.nn.Linear的笔记

torch.nn.Linear的使用方法

torch.nn.Linear和torch.nn.MSELoss

pytorch中的神经网络子模块(线性模块)——torch.nn.Linear

nn.linear()函数

torch.nn.Embedding理解

torch.nn.Parameter理解

torch.nn.MSELoss()函数

pytorch nn.Modlue及nn.Linear 源码理解

pytorch系列： nn.Modlue及nn.Linear 源码理解

torch.nn.conv3d理解

Pytorch：torch.nn.Parameter理解

pytorch-torch.nn-激活函数

torch.nn.functional.pad()函数的使用

【Python/torch】torch.nn.functional.interpolate()函数解析

[转载]Pytorch中nn.Linear module的理解

对于torch.nn.AdaptiveAvgPool2d()自适应平均池化函数的一些理解

Pytorch中torch.nn.Conv3D、torch.nn.Conv2D函数详解

「Deep Learning」理解Pytorch中的「torch.nn」

PyTorch之 torch.nn.Embedding 词嵌入层的理解

nn.linear()的用法

torch.squeeze()函数的理解

torch.nn.functional.conv2d 函数详解

torch.nn.Conv2d()函数详解

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)