Introduction to Recurrent Neural Networks

What is RNN

The networks are recurrent because they performance same computations for all the elements of a sequence of input, and the output of each element dependents, in addition to current input, from all the previous commutations.

Why RNN

  • Sequential type information of the inputs
    Video Analysis
    Speech Recognition
    Machine Translation
  • RNN have proved to have excellent performance in such problems

RNN Procedure

这里写图片描述

Sigmoid Gradient

这里写图片描述

The Vanish Gradient Problem

Consider the recurrent networks:

h t = σ ( U x t + V h t 1 )

then,
h 3 = σ ( U x 3 + V ( σ ( U x 2 + V ( σ ( U x 1 ) ) ) ) )

E 3 U = E 3 o u t 3 o u t 3 h 3 h 3 h 2 h 2 h 1 h 1 U

LSTM Cell

这里写图片描述

  • Input Gate

    g = t a n h ( b g + x t U g + h t 1 V g )

    i = σ ( b i + x t U i + h t 1 V i )

    o u t i = g i

  • forget gate

    f = σ ( b f + x t U f + h t 1 V f )

    s t = s t 1 f + g i

  • output gate

    扫描二维码关注公众号,回复: 874545 查看本文章
    o = σ ( b o + x t U o + h t 1 V o )

    h t = t a n h ( s t ) o

Reducing The Problem

s t s t 1 = f

Reference

猜你喜欢

转载自blog.csdn.net/volvet/article/details/80031566