论文笔记：残差网络 Deep Residual Learning for Image Recognition

其他 2020-08-04 20:53:53 阅读次数: 0

论文笔记: Deep Residual Learning for Image Recognition

目标：利用残差网络是的训练更加简单

面对的问题：

degradation Problem：当添加的网络层次变多，精确度逐渐饱和，网络层次将趋近饱和。

Intuition：

文中的想法是将堆叠的感知器学习原有输出的残差。

具体表示是：假设 $\mathcal{H}(\mathbf{x})$ 为表示某几层感知器的， $x$ 是输入。那么我们令 $\mathcal{F}(\mathbf{x}):=\mathcal{H}(\mathbf{x})-\mathbf{x}$

那么原来的输出将变成： $\mathcal{F}(\mathbf{x})+\mathbf{x} =\mathcal{H}(\mathbf{x})$ , 让感知层用来学习 $\mathcal{F}$

尽管两者将同时迭代到需要的方程，但两者的难易程度不同。

我们假设后者比前者容易。

在这里插入图片描述

如图中所示，
$\mathbf{y}=\mathcal{F}\left(\mathbf{x},\left\{W_{i}\right\}\right)+\mathbf{x}$
其中 $\mathcal{F}=W_{2} \sigma\left(W_{1} \mathbf{x}\right)$ ， $\sigma$ 便是ReLU

如果 $\mathbf{x}$ and $\mathcal{F}$ 不是相同的纬度，那遍让 $\mathbf{x}$ 乘以一个矩阵 $W_s$ :
$\mathbf{y}=\mathcal{F}\left(\mathbf{x},\left\{W_{i}\right\}\right)+W_{s} \mathbf{x}$
即使纬度相同，也可以乘以一个方阵。但是作者认为单位矩阵就足够解决degradation problem.

扫描二维码关注公众号，回复： 11512380 查看本文章

关于 $\mathcal{F}$ 的选取，作者认为可以是各种样子，可以是很多层，但如果是只有一层，就与线性层没有差别，并不能看出什么优势。

网络结构

普通结构

Baseline 由VCG网络而来。大部分的过滤器由3x3构成，而且都服从以下两条规则：

(i) 如果输出的特征图相同，那么过滤器数量不变

(ii)如果输出的特征图纬度减半，那么过滤器数量翻倍

残差网络

在普通结构上使用残差结构。如果输入输出纬度相同，那我们直接使用单位矩阵。
$\mathbf{y}=\mathcal{F}\left(\mathbf{x},\left\{W_{i}\right\}\right)+\mathbf{x}$
如果维度（图中虚线部分）缩减，将有两种原则：

（A）捷径的部分（x）仍然使用单位矩阵。缺失的部分添加0来解决，这种方式没有增加多余的参数

（B）用（2）式中的方式，将 $\mathbf{x}$ 映射到相同纬度上来

对于两种方法，如果残差网络经过了纬度变换的时候，他们的步长为2（也就是说每种维度的特征图各取一层）

在这里插入图片描述

最后的实验证明两种方式（A）（B）并没有太多表现上的区别

猜你喜欢

转载自blog.csdn.net/ArchibaldChain/article/details/107747029

论文笔记：残差网络 Deep Residual Learning for Image Recognition

《Deep Residual Learning for Image Recognition》残差网络 -- 解析笔记

残差网络(Deep Residual Learning for Image Recognition)

论文-Deep Residual Learning for Image Recognition

Deep Residual Learning for Image Recognition笔记

Deep Residual Learning for Image Recognition 笔记

Deep Residual Learning for Image Recognition 论文笔记

《ResNet-Deep Residual Learning for Image Recognition》论文笔记

论文笔记：Deep Residual Learning for Image Recognition

Deep Residual Learning for Image Recognition

[深度学习]Deep Residual Learning for Image Recognition(ResNet,残差网络)阅读笔记

论文阅读(二)ResNet(Deep Residual Learning for Image Recognition)笔记

ResNet论文阅读---《Deep Residual Learning for Image Recognition》

[论文理解]Deep Residual Learning for Image Recognition

Deep Residual Learning for Image Recognition 论文学习

Deep Residual Learning for Image Recognition----ResNet论文阅读

ResNet来源论文《Deep Residual Learning for Image Recognition》读后总结

ResNet论文详解：《Deep Residual Learning for Image Recognition》

论文阅读——ResNet，Deep Residual Learning for Image Recognition

论文阅读|ResNet：Deep Residual Learning for Image Recognition

Deep Residual Learning for Image Recognition论文翻译（非google翻译）

Deep Residual Learning for Image Recognition （ResNet）论文详细解读

ResNet(Deep Residual Learning for Image Recognition)

翻译：Deep Residual Learning for Image Recognition

ResNet: Deep Residual Learning for Image Recognition详解

Deep Residual Learning for Image Recognition（译）

Deep Residual Learning for Image Recognition（ResNet）阅读

Deep Residual Learning for Image Recognition(ResNet)

Paper | Deep Residual Learning for Image Recognition

ResNet-Deep Residual Learning for Image Recognition

今日推荐

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

《2024 年一季度互联网投融资运行情况》研究报告

报告：Django 仍然是 74% 开发者的首选

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

周排行

记一下去大梅沙的准备（2018-05-26）

Spring 注解事务

基于HTTP协议的客户端缓存

阿里云rds 备份和还原

[PHP] 几个拖慢 PHP 程序/API 运行速度的点

python 代码风格------------PEP8规则

js控制json生成菜单——自制菜单（一）

将字符串: 'k:1|k1:2|k2:3|k3:4 ' ,处理成 python 字典: {'k':1, 'k1':2, ...}

微信小程序转支付宝小程序

Qt551.窗口滚动条

每日归档

更多

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)