Introduction to Recurrent Neural Networks - 代码天地

Introduction to Recurrent Neural Networks

其他 2018-05-16 20:11:14 阅读次数: 2

What is RNN

The networks are recurrent because they performance same computations for all the elements of a sequence of input, and the output of each element dependents, in addition to current input, from all the previous commutations.

Why RNN

Sequential type information of the inputs
Video Analysis
Speech Recognition
Machine Translation
RNN have proved to have excellent performance in such problems

RNN Procedure

这里写图片描述

Sigmoid Gradient

这里写图片描述

The Vanish Gradient Problem

Consider the recurrent networks:

h_{t} = σ (U x_{t} + V h_{t - 1})

$h_t = \sigma(Ux_t+Vh_{t-1})$
then,

h_{3} = σ (U x_{3} + V (σ (U x_{2} + V (σ (U x_{1})))))

$h_3 = \sigma(Ux_3+V(\sigma(Ux_2+V(\sigma(Ux_1)))))$

\frac{\partial E_{3}}{\partial U} = \frac{\partial E_{3}}{\partial o u t_{3}} \frac{\partial o u t_{3}}{\partial h_{3}} \frac{\partial h_{3}}{\partial h_{2}} \frac{\partial h_{2}}{\partial h_{1}} \frac{\partial h_{1}}{\partial U}

$\frac{\partial E_3}{\partial U}=\frac{\partial E_3}{\partial out_3}\frac{\partial out_3}{\partial h_3}\frac{\partial h_3}{\partial h_2}\frac{\partial h_2}{\partial h_1}\frac{\partial h_1}{\partial U}$

LSTM Cell

这里写图片描述

Input Gate

$g = t a n h (b^{g} + x_{t} U^{g} + h_{t - 1} V^{g})$ $g = tanh(b^g+x_tU^g+h_{t-1}V^g)$
$i = σ (b^{i} + x_{t} U^{i} + h_{t - 1} V^{i})$ $i=\sigma(b^i+x_tU^i+h_{t-1}V^i)$
$o u t_{i} = g \circ i$ $out_i=g\circ i$
forget gate

$f = σ (b^{f} + x_{t} U^{f} + h_{t - 1} V^{f})$ $f = \sigma(b^f+x_tU^f+h_{t-1}V^f)$
$s_{t} = s_{t - 1} \circ f + g \circ i$ $s_t = s_{t-1}\circ f + g\circ i$
output gate

扫描二维码关注公众号，回复： 874545 查看本文章
$o = σ (b^{o} + x_{t} U^{o} + h_{t - 1} V^{o})$ $o = \sigma(b^o+x_tU^o+h_{t-1}V^o)$
$h_{t} = t a n h (s_{t}) \circ o$ $h_t = tanh(s_t)\circ o$

Reducing The Problem

\frac{\partial s_{t}}{\partial s_{t - 1}} = f

$\frac{\partial s_t}{\partial s_{t-1}} = f$

Reference

http://adventuresinmachinelearning.com/recurrent-neural-networks-lstm-tutorial-tensorflow/
Deep Learning with Tensorflow

猜你喜欢

转载自blog.csdn.net/volvet/article/details/80031566

Introduction to Recurrent Neural Networks

Recurrent Neural Networks Tutorial, Part 1 – Introduction to RNNs

RNN(Recurrent Neural Networks)

Recurrent Neural Networks——RNN

Gated Recurrent Neural Networks

Recurrent Neural Networks 简述

019 Recurrent Neural Networks

Introduction To Neural Networks

Recurrent Neural Networks, LSTM, GRU

Recurrent Neural Networks by Example in Python

Deep learning - Introduction to Neural Networks

RNN:The Unreasonable Effectiveness of Recurrent Neural Networks

Recurrent Neural Networks for Emotion Recognition in Video

Multi-Dimensional Recurrent Neural Networks

sp5.1 Recurrent Neural Networks

RNN(Recurrent Neural Networks)和LSTM

Joint Event Extraction via Recurrent Neural Networks

Attention and Augmented Recurrent Neural Networks【译文】

Lecture 6: Language Models and Recurrent Neural Networks

10_introduction_to_artificial_neural_networks

Introduction of Convolutional Neural Networks — PPT(交流使用)

[ML] 2. Introduction to neural networks

A Gentle Introduction to Graph Neural Networks阅读笔记

循环神经网络（RNN）Recurrent Neural Networks

On the difficulty of training Recurrent Neural Networks中RNN完美复现

循环神经网络(RNN, Recurrent Neural Networks)介绍

文章：Emotion Recognition From Speech With Recurrent Neural Networks

Relational recurrent neural networks 论文代码阅读及实现例子

[序列模型] Recurrent Neural Networks习题解析

CS231n: Lecture 10 | Recurrent Neural Networks

今日推荐

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

周排行

浏览器对同一域名进行请求的最大并发连接数

React Hook之自定义Hook

【转】MyBatis缓存机制

-Java-泛型

自动化测试常用脚本-发送邮件

LeetCode#859: Buddy Strings

java、Python处理字符串

第二篇の博客

Hadoop伪分布式环境安装

SQL Server进阶（十一）临时表、表变量

每日归档

更多

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)