Training Very Deep Networks - 代码天地

Training Very Deep Networks

其他 2018-05-31 23:32:27 阅读次数: 1

Rupesh Kumar Srivastava
Klaus Greff
̈
J urgen
Schmidhuber
The Swiss AI Lab IDSIA / USI / SUPSI
{rupesh, klaus, juergen}@idsia.ch

Abstract
Theoretical and empirical evidence indicates that the depth of neural networks
is crucial for their success. However, training becomes more difficult as depth
increases, and training of very deep networks remains an open problem. Here we
introduce a new architecture designed to overcome this. Our so-called highway
networks allow unimpeded information flow across many layers on information
highways. They are inspired by Long Short-Term Memory recurrent networks and
use adaptive gating units to regulate the information flow. Even with hundreds of
layers, highway networks can be trained directly through simple gradient descent.
This enables the study of extremely deep and efficient architectures.

1
Introduction & Previous Work
Many recent empirical breakthroughs in supervised machine learning have been achieved through
large and deep neural networks. Network depth (the number of successive computational layers) has
played perhaps the most important role in these successes. For instance, within just a few years, the
top-5 image classification accuracy on the 1000-class ImageNet dataset has increased from ∼84%
[1] to ∼95% [2, 3] using deeper networks with rather small receptive fields [4, 5]. Other results on
practical machine learning problems have also underscored the superiority of deeper networks [6]
in terms of accuracy and/or performance.
In fact, deep networks can represent certain function classes far more efficiently than shallow ones.
This is perhaps most obvious for recurrent nets, the deepest of them all. For example, the n bit
parity problem can in principle be learned by a large feedforward net with n binary input units, 1
output unit, and a single but large hidden layer. But the natural solution for arbitrary n is a recurrent
net with only 3 units and 5 weights, reading the input bit string one bit at a time, making a single
recurrent hidden unit flip its state whenever a new 1 is observed [7]. Related observations hold for
Boolean circuits [8, 9] and modern neural networks [10, 11, 12].

猜你喜欢

转载自www.cnblogs.com/2008nmj/p/9119534.html

Training Very Deep Networks

Training Very Deep Networks论文笔记

Highway Networks (Training Very Deep Networks, 2015 NIPS)

DiracNets: Training Very Deep Neural Networks Without Skip-Connections

Very Deep Convolutional Networks for Text Classification

Channel Pruning for Accelerating Very Deep Neural Networks

Mixed-Precision Training of Deep Neural Networks

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

论文阅读笔记：VGG：Very Deep Convolutional Networks

Channel Pruning for Accelerating Very Deep Neural Networks 论文笔记

[CV Paper] A practical theory for designing very deep convolutional neural networks

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNTION（翻译）

《Channel Pruning for Accelerating Very Deep Neural Networks》论文笔记

(35) [arXiv17] Very Deep Convolutional Networks for Text Classification

VGG： VERY DEEP Convolutional Networks for large-sacle Image Recognition

Very Deep Convolutional Networks For Large-Scale Image Recognition(VGGnet)

VGG: Very Deep Convolutional Networks for Large-Scale Image Recognition

VGG —— Very Deep Convolutional Networks for Large-Scale Image Recognition

VGG：VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

SiamRPN++: Evolution of Siamese Visual Tracking with Very Deep Networks

【阅读笔记】Training Deep Neural Networks on Imbalanced Data Sets

《Understanding the difficulty of training deep feedforward neural networks》笔记

Xavier——Understanding the difficulty of training deep feedforward neural networks

LIF模型及其变种 Training Spiking Deep Networks for Neuromorphic Hardware

2014-VGG-《Very deep convolutional networks for large-scale image recognition》翻译

VGGNet论文（Very Deep Convolutional Networks for Large-Scale Image Recognition）（译）

VGGNet论文学习记录：VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

论文理解 - VGGNet - Very Deep Convolutional Networks for Large-Scale Image Recognition

VGG-《Very deep convolutional networks for large-scale image recognition》翻译

VGG-net《Very Deep Convolutional Networks for Large-Scale Image Recognition》

今日推荐

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

周排行

rbac——界面、权限

Apache CXF + SpringMVC 整合发布WebService

so插件化

Vue.js实战系列---图标字体制作（svg格式）

PAT乙级 1007 素数对猜想(孪生素数对) (20分) ---（C语言 + 详细注释）

被IRM保护的文档，打开失败

Calendar和Date计算日期差的小问题

win10子系统ubuntu18.4安装docker

利用Wrap Shell Script定位Android Native内存泄漏

MySQL: Transaction (Part I - Basic Concept)

每日归档

更多

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)