Recurrent Neural Networks Tutorial, Part 1 – Introduction to RNNs

其他 2018-07-09 17:21:18 阅读次数: 0

Recurrent Neural Networks Tutorial, Part 1 – Introduction to RNNs

What is RNN？

RNN的核心思想是利用时序信息。在传统的神经网络中，我们通常假设所有的输入(输出)相互之间都是独立的。但是在很多实际的应用中这是一个非常不好的假设。比如我们要预测一个句子中的下一个单词，我们最好能知道上一个单词是什么。
RNN中的R代表Recurrent，意味着它对每一个单元进行顺序的重复操作，每一次输出都和前面的运算结果相关。另一个理解RNN的方法就是构造“记忆”的概念，RNN拥有的“记忆”可以获取之前计算的信息。理论上，RNN可以利用任意长结果的时序信息，但是在实际应用中，RNN受限于只能获取之前几个块的信息。

$x_t$ is the input at time step t. For example, $x_1$ could be a one-hot vector corresponding to the second word of a sentence.
$s_t$ is the hidden state at time step t. It’s the “memory” of the network. $s_t$ is calculated based on the previous hidden state and the input at the current step: $s_t=f(Ux_t + Ws_{t-1})$ . The function f usually is a nonlinearity such as tanh or ReLU. $s_{-1}$ , which is required to calculate the first hidden state, is typically initialized to all zeroes.
$o_t$ is the output at step t. For example, if we wanted to predict the next word in a sentence it would be a vector of probabilities across our vocabulary. $o_t = \mathrm{softmax}(Vs_t)$ .
我们可以将隐藏状态 $s_t$ 看作是网络的记忆。 $s_t$ 捕捉在前几次网络运算中所包含的信息。每个时刻的输出 $o_t$ 只和该时刻的记忆有关。
和传统的神经网络不同， RNN每一层都共享同样的参数(如前文中的U,V,W)。这表明我们是在重复地执行同样的步骤，只是每个时刻的输入有所不同。这极大地减少了我们运算所需要存储的权值。
上述过程的每个时刻都有一个输出，但根据不同的应用场景，这个输出不是必要的。

What can RNNs do?

猜你喜欢

转载自blog.csdn.net/frankzd/article/details/80006290

Recurrent Neural Networks Tutorial, Part 1 – Introduction to RNNs

Introduction to Recurrent Neural Networks

RNN(Recurrent Neural Networks)

Recurrent Neural Networks——RNN

Recurrent Neural Networks 简述

Gated Recurrent Neural Networks

019 Recurrent Neural Networks

Sequence Models(Week1)---Recurrent Neural Networks

Introduction To Neural Networks

Recurrent Neural Networks, LSTM, GRU

Recurrent Neural Networks by Example in Python

Neural Networks and Deep Learning (Week 1)——Introduction to deep learning

Deep learning - Introduction to Neural Networks

RNN:The Unreasonable Effectiveness of Recurrent Neural Networks

Recurrent Neural Networks for Emotion Recognition in Video

Multi-Dimensional Recurrent Neural Networks

sp5.1 Recurrent Neural Networks

RNN(Recurrent Neural Networks)和LSTM

Joint Event Extraction via Recurrent Neural Networks

Attention and Augmented Recurrent Neural Networks【译文】

Lecture 6: Language Models and Recurrent Neural Networks

DeepLearning.ai作业:(5-1)-- 循环神经网络（Recurrent Neural Networks）（1）

译文Neural Networks Part 1: Setting up the Architecture

A Recipe for Training Neural Networks [中文翻译, part 1]

【论文笔记1】RNN在图像压缩领域的运用——Variable Rate Image Compression with Recurrent Neural Networks

【读】seq2seq——（1）Generating News Headlines with Recurrent Neural Networks

DeepLearning.ai作业:(5-1)-- 循环神经网络（Recurrent Neural Networks）（3）

DeepLearning.ai作业:(5-1)-- 循环神经网络（Recurrent Neural Networks）（2）

DeepLearning.ai笔记:(5-1)-- 循环神经网络（Recurrent Neural Networks）

Recurrent Neural Network(1):Architecture

今日推荐

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

周排行

rbac——界面、权限

Apache CXF + SpringMVC 整合发布WebService

so插件化

Vue.js实战系列---图标字体制作（svg格式）

PAT乙级 1007 素数对猜想(孪生素数对) (20分) ---（C语言 + 详细注释）

被IRM保护的文档，打开失败

Calendar和Date计算日期差的小问题

win10子系统ubuntu18.4安装docker

利用Wrap Shell Script定位Android Native内存泄漏

MySQL: Transaction (Part I - Basic Concept)

每日归档

更多

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)