《Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning》笔记 - 代码天地

《Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning》笔记

其他 2018-09-29 17:41:12 阅读次数: 0

版权声明：本文为博主原创文章，未经博主允许不得转载。 https://blog.csdn.net/KangRoger/article/details/69488642

Inception结构有着良好的性能，且计算量低。Residual connection不同于传统网络结构，且在2015 ILSVRC取得冠军，它的性能和Inception-v3接近。作者尝试将Inception结构和Residual connection结合，提出了新的网络结构Inception-v4。Inception-v4在ImageNet分类比赛中，top-5的错误率为3.08%。

介绍

Residual connection使得训练更深的网络变得容易。Inception结构也使得网络变深。将Residual connection添加到Inception结构中，使得Inception结构得到Residual connection的“益处”，且保留计算的高效。

作者也尝试了不使用Residual connection，而是直接拓展Inception结构，使得它更深更宽。作者也设计了不用Residual connection版本的Inception-v4。

相关工作

卷积网络在图像识别领域已经十分流行，经典网络有AlexNet、VGGNet、GoogLeNet等。Residual connection的提出是用了训练更深的网络，但是作者发现不使用Residual connection也可以训练更新的网络，Residual connection并不是必要条件；只是使用了Residual connection会加快训练速度。

Inception结构最初由GoogLeNet引入，GoogLeNet叫做Inception-v1；之后引入了BatchNormalization，叫做Inception-v2；随后引入分解，叫做Inception-v3。

网络架构

Inception-v4共有四种结构，一种不包含Residual connection结构的；包含Residual connection结构的，根据包含Inception模块的不同又分为2种：Inception-ResNet-v1和Inception-ResNet-v2。

Inception-v4

下图是Inception-v4的结构：

Stem模块为：

下面分别为：Inception-A、Inception-B、Inception-C模块。

不同的Inception模块的连接，减小了feature map，却增加了filter bank。

35x35变为17x17模块，即Reduction-A

17x17变为8x8模块，即Reduction-B

Inception-ResNet

Inception-ResNet的两个版本，结构基本相同，只是细节不同。整体结构为：

Inception-ResNet的两个版本对应Inception-resnet-A、Inception-resnet-B、Inception-resnet-C略微不同。

其中Inception-ResNet-v1和Inception-ResNet-v2对应的Inception-resnet-A模块为：

其中Inception-ResNet-v1和Inception-ResNet-v2对应的Inception-resnet-B模块为：

其中Inception-ResNet-v1和Inception-ResNet-v2对应的Inception-resnet-C模块为：

Inception-ResNet的stem模块和Reduction-B模块也略微不同。Inception-ResNet-v1和Inception-ResNet-v2主要在于Reduction-A结构不同：

其中k,l,m,n表示filter bank size。

缩放Residuals

当卷积核个数超过1000时，训练将会变得不稳定，在训练的早期，网络“died”。这是缩小Residuals有助于稳定训练，缩小因子介于0.1到0.3。

He在训练Residual Net时也发现这个问题，提出了“two phase”训练。首先“warm up”，使用较小的学习率。接着再使用较大的学习率。

训练方法

使用Momentum + SGM，momentum=0.9。使用RMSProp，decay为0.9， $\epsilon=1.0$ 。

学习率为0.045，每2个epoch缩小为原理的0.94。

实验结果

猜你喜欢

转载自blog.csdn.net/KangRoger/article/details/69488642

Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning 论文笔记

《Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning》笔记

『Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning』论文笔记

GoogleNetV4 Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning

Inception-v4，Inception-ResNet and the Impact of Residual Connections on Learning

Inception-V4, Inception-ResNet and the Impact of Residual Connections on Learning

【Network Architecture】Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning（转） Feature Extractor[Inception v4]

【论文阅读】Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning【v4, 2016】

How to design DL model(2):Inception(v4)-ResNet and the Impact of Residual Connections on Learning

Inception-V4和Inception-Resnet论文阅读和代码解析

CNN卷积神经网络之Inception-v4,Inception-ResNet

Inception v4, Inception-ResNet文章复现

学习笔记：inception V4 与resnet

Inception-v4 翻译及总结

【第62篇】Inception-v4

Inception-V1到Inception-V4

深度篇—— Deep Learning 经典网络 model 发展史(七) 细说 Inception-ResNet 结构和特点

Xception,Inception-ResNet,SENet(Squeeze-and-Excitation)

卷积神经网络（六）：谷歌的深度学习网络架构Inception与Inception-ResNet

Inception系列：Inception-v1（GooLeNet）、Inception-v2、Inception-v3、Inception-v4论文全面解读

比较新的网络模型：Inception-v3 ， ResNet， Inception-v4， Dual-Path-Net ， Dense-net ， SEnet ， Wide ResNet

Inception-Resnet-v1、Inception-Resnet-v2学习笔记（附Pytorch代码）

InceptionV4 Inception-ResNet 论文研读及Pytorch代码复现

《ResNet-Deep Residual Learning for Image Recognition》论文笔记

论文阅读(二)ResNet(Deep Residual Learning for Image Recognition)笔记

【图像分类】【深度学习】【Pytorch版本】Inception-ResNet模型算法详解

二、CNNs网络架构-卷积分离网络架构（VGGNet、GoogLeNet、GoogleNet v2、GoogleNet v3、GoogleNet v4、Inception-ResNet）

ResNet(Deep Residual Learning for Image Recognition)

ResNet: Deep Residual Learning for Image Recognition详解

Deep Residual Learning for Image Recognition（ResNet）阅读

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

基本数据类型封装类比较 Java源码解读(一) 8种基本类型对应的封装类型

JS实现无缝滚动上

深入解析HashMap原理（基于JDK1.8）

mysql的连接池

关于.htc

linux下的ubuntu12.04图形界面

【数论】好推不好记的扩展欧几里德

设备树详解

cscope + tags 简单设置

xml学习

每日归档

更多

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)