Highway Networks (Training Very Deep Networks, 2015 NIPS)

其他 2019-01-16 04:20:50 阅读次数: 0

版权声明：本文为博主原创文章，未经博主允许不得转载。 https://blog.csdn.net/u010598445/article/details/78956682

Reference

arxiv paper
csdn
Highway Networks vs ResNet
Deep Residual Learning for Image Recognition (ResNet 2016 CVPR)
Densely Connected Convolutional Networks (DenseNet 2017 CVPR)

Why Deeper Networks?

The Deeper, The Better. (No Consider Computation Complexity)
Recent evidence [40, 43] reveals that network depth is of crucial importance

Why Train Deeper Networks Harder?

An obstacle to answering this question was the notorious problem of vanishing/exploding gradients [14, 1, 8]

How To Train Very Deep Networks ?

Good Init
Local competition may help to train deeper networks [20,21]
Skip Connection [2,22,23,24]
multiple stage train [25]
layer-wise train [26,27]

from Highway Net
[2] Going deeper with convolutions
[20] Maxout Networks
[21] Compete to Compute
[22] Deep learning made easier by linear transformations in perceptrons
[23] Generating sequences with recurrent neural networks
[24] Deeply-supervised nets
[25] FitNets: Hints for thin deep nets
[26] Learning complex, extended sequences using the principle of history compression.
[27] A fast learning algorithm for deep belief nets.

from ResNet
[40] K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.
[43] C. Szegedy,W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Er- han, V. Vanhoucke, and A. Rabinovich. Going deeper with convolu- tions. In CVPR, 2015
[1] Y. Bengio, P. Simard, and P. Frasconi. Learning long-term dependen- cies with gradient descent is difficult. IEEE Transactions on Neural Networks, 5(2):157–166, 1994.
[8] X. Glorot and Y. Bengio. Understanding the difficulty of training deep feedforward neural networks. In AISTATS, 2010
[14] S. Hochreiter. Untersuchungen zu dynamischen neuronalen netzen. Diploma thesis, TU Munich, 1991.

猜你喜欢

转载自blog.csdn.net/u010598445/article/details/78956682

Highway Networks (Training Very Deep Networks, 2015 NIPS)

Training Very Deep Networks

Training Very Deep Networks论文笔记

DiracNets: Training Very Deep Neural Networks Without Skip-Connections

Highway Networks

Very Deep Convolutional Networks for Text Classification

Channel Pruning for Accelerating Very Deep Neural Networks

Backbone-VGG[ICLR2015] Very Deep Convolutional Networks for Large-Scale Image Recognition

Mixed-Precision Training of Deep Neural Networks

(37)[NIPS13] Deep Neural Networks for Object Detection

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

论文阅读笔记：VGG：Very Deep Convolutional Networks

Channel Pruning for Accelerating Very Deep Neural Networks 论文笔记

[CV Paper] A practical theory for designing very deep convolutional neural networks

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNTION（翻译）

《Channel Pruning for Accelerating Very Deep Neural Networks》论文笔记

(35) [arXiv17] Very Deep Convolutional Networks for Text Classification

VGG： VERY DEEP Convolutional Networks for large-sacle Image Recognition

Very Deep Convolutional Networks For Large-Scale Image Recognition(VGGnet)

VGG: Very Deep Convolutional Networks for Large-Scale Image Recognition

VGG —— Very Deep Convolutional Networks for Large-Scale Image Recognition

VGG：VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

SiamRPN++: Evolution of Siamese Visual Tracking with Very Deep Networks

[论文笔记] highway networks

【阅读笔记】Training Deep Neural Networks on Imbalanced Data Sets

《Understanding the difficulty of training deep feedforward neural networks》笔记

Xavier——Understanding the difficulty of training deep feedforward neural networks

LIF模型及其变种 Training Spiking Deep Networks for Neuromorphic Hardware

2014-VGG-《Very deep convolutional networks for large-scale image recognition》翻译

VGGNet论文学习记录：VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

基本数据类型封装类比较 Java源码解读(一) 8种基本类型对应的封装类型

JS实现无缝滚动上

深入解析HashMap原理（基于JDK1.8）

mysql的连接池

关于.htc

linux下的ubuntu12.04图形界面

【数论】好推不好记的扩展欧几里德

设备树详解

cscope + tags 简单设置

xml学习

每日归档

更多

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)