Deep learning II - II Optimization algorithms - RMSprop (Root Mean Square prop)均方根传递 - 代码天地

Deep learning II - II Optimization algorithms - RMSprop (Root Mean Square prop)均方根传递

其他 2018-06-24 05:13:45 阅读次数: 3

RMSprop

相较于gradient descent with momentum，RMSprop的思想是，对于梯度震动较大的项，在下降时，减小其下降速度；对于震动幅度小的项，在下降时，加速其下降速度。
通过使用指数加权平均计算得到 $S_{{\rm d}w},\ S_{{\rm d}b}$ ；使用他们来更新参数（如下图所示）

S_{d w} = β S_{d w} + (1 - β) d w^{2}

$S_{{\rm d}w} = \beta S_{{\rm d}w} + (1-\beta){\rm d}w^2$

S_{d b} = β S_{d b} + (1 - β) d b^{2}

$S_{{\rm d}b} = \beta S_{{\rm d}b} + (1-\beta){\rm d}b^2$

w := w - α \frac{d w}{\sqrt{S_{d w}} + ϵ}

$w := w - \alpha \frac{{\rm d}w}{\sqrt{S_{{\rm d}w}}+\epsilon}$

b := b - α \frac{d b}{\sqrt{S_{d b}} + ϵ}

$b := b - \alpha \frac{{\rm d}b}{\sqrt{S_{{\rm d}b}}+\epsilon}$

$\epsilon = 10^{-8}$ ，是为了保证分母不为零； ${\rm d}w^2$ 和 ${\rm d}b^2$ 指的是element-wise

猜你喜欢

转载自blog.csdn.net/zfcjhdq/article/details/80746066

Deep learning II - II Optimization algorithms - RMSprop (Root Mean Square prop)均方根传递

Deep learning II - II Optimization algorithms - learning rate decay 学习速率衰减

Deep learning II - II Optimization algorithms - Mini-batch gradient descent

Deep learning II - II Optimization algorithms - Adam (Adaptive Moment Estimation)自适应矩估计

Deep learning II - II Optimization algorithms - Gradient descent with momentum 动量梯度下降算法

Deep learning II - II Optimization algorithms - Exponentially weighted averages 指数加权平均

均方根误差RMSE（Root Mean Square Error）

优化梯度下降算法 Momentum、RMSProp(Root mean square propagation)和Adam( Adaptive Moment Estimation)

「Deep Learning」Note on RMSprop

Maximal Square II

Deep Learning 最优化方法之RMSProp

Optimization algorithm----Deep Learning

Optimization for Deep Learning Highlights in 2017

Deep learning II - I Practical aspects of deep learning - other regularization methods 其他正则化方法

Deep learning II - I Practical aspects of deep learning - Understanding dropout 理解随机失活正则化

Deep learning II - I Practical aspects of deep learning - Dropout regularization 随机失活正则化

Deep learning II - I Practical aspects of deep learning - 正则化如何较少过拟合

Deep learning II - I Practical aspects of deep learning - Gradient checking 梯度检查

Deep learning II - I Practical aspects of deep learning - Vanishing/Exploring gradients 梯度消失/爆炸

Deep learning II - I Practical aspects of deep learning - Normalizing inputs 输入归一化

Gentle Introduction to the Adam Optimization Algorithm for Deep Learning

Review of Deep Learning Algorithms for Object Detection

Improving Deep Neural Networks (Week2)---Optimization algorithms

Deep learning II - I Practical aspects of deep learning - Regularizing your neural network 神经网络范数正则化

优化算法optimization：RMSProp

Leetcode算法——59、螺旋矩阵II（square matrix II）

模板匹配--Image Alignment Algorithms - Part II

Deep learning II - III Batch Normalization - Normalizing activation in a network 激励函数输出归一化

李宏毅学习笔记18.Unsupervised Learning: Deep Generative Model (Part II)

均方误差（mean-square error, MSE）

今日推荐

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

【转】spring中对控制反转和依赖注入的理解

tms webcore 安装和使用

java程序员进阶相关书籍

SpringMVC接受请求参数、

如何保存训练好的机器学习模型

MyEclipse、Eclipse设置项目JDK的三个地方

商超行业微信小程序开发定制一般多少钱（行业技术人员解读）

Markdown编辑器语言——30分钟入门到到精通

Linux系统下MongoDB的简单安装与基本操作

Power Strings

每日归档

更多

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)