Machine learning notes: seq2seq & attentioned seq2seq - Code World

Machine learning notes: seq2seq & attentioned seq2seq

Mobile 2023-09-30 21:03:46 views: null

1Seq2Seq

1.1 Introduction

For the sequence pair <X,Y>, our goal is to give the input sequence X and expect to generate the target sequence Y through the Encoder-Decoder framework.

Encoder encodes the input sequence X and converts the input sequence into an intermediate semantic representation C through non-linear transformation:
Decoder generates the next value to be generated at time i based on the intermediate semantic representation C of the sequence X and the previously generated historical information y1, y2….yi-1: yi

1.2 Disadvantages

The Encoder-Decoder framework has an obvious shortcoming.
- Encoder will encode the input sequence X into a fixed-length latent vector (semantic encoding c)
  - 1. The size of the latent vector is limited and cannot represent information-rich sequences;
  - 2. Due to the characteristics of RNN-type networks, the network will pay more attention to the information behind the sequence and cannot grasp the overall situation.

2 attentioned Seq2Seq

where: the semantic encoding ci of each element:

hj is the hidden state of each element of the encoder, αij is the weighting coefficient

4

st-1 is the output of decoder t-1 position

Guess you like

Origin blog.csdn.net/qq_40206371/article/details/133183838

Machine learning notes: seq2seq & attentioned seq2seq

Seq2Seq + Attention Detailed Explanation (Based on NEURAL MACHINE TRANSLATION BY JOINTLY LEARNING TO ALIGN AND TRANSLATE)

Encoder-Decoder Architecture, Seq2Seq Brief Notes

NLP learning record 5 - seq2seq model

NLP learning (5) ---- seq2seq / transformer

seq2seq function

14 days depth hands-on science learning task2 "hands-on learning": attentional mechanisms and Seq2seq model notes

Introduction to Deep Learning (65) Recurrent Neural Network - Sequence to Sequence Learning (seq2seq)

Task04: machine translation and related technology; attention mechanisms and Seq2seq model; Transformer

Machine translation and related technologies, and Seq2seq attention mechanism model, Transformer

Natural language processing-machine translation, Seq2seq, Attention

Seq2Seq PyTorch translation model based on detailed notes presentation (a)

[Artificial Intelligence] Section notes: Based Keras of seq2seq bot

Study Notes CB013: TensorFlow, TensorBoard, seq2seq

Pytorch learning record - training GRU Seq2Seq (read the paper)

Examples Pytorch learning record -torchtext and Pytorch (using neural network training Seq2Seq Code)

Implementation of seq2seq (2)

seq2seq (1) - EncoderDecoder architecture

seq2seq keras achieve

seq2seq + attention Interpretation

26 seq2seq model

Seq2Seq model and attentional mechanisms

Seq2Seq installation and problem solving

The realization of seq2seq (3)

seq2seq implements digital addition

Implementation of seq2seq (1)

Seq2Seq - - attention mechanism

The principle and implementation of seq2seq model

Artificial Intelligence-Generative Model-Seq2Seq: Seq2Seq model optimization program

seq2seq entirely based on neural network convolution

Recommended

Ranking

Linux关机和重启详解（shutdown、halt、poweroff、reboot、init）

Netty work notes 0007---NIO's three core component relationships

Knife4j tutorial

2021.10.29，内容:什么时候用接口和抽象类

How to solve the problem that changing the memory frequency causes the computer to become unusable?

SpringMVC Tutorial - Controller

linux learning skills -Linux 25 transport Vega paid special privileges and facl extension

Financial quarterly report evaluation report data automatic generation 1.0

Agile Development Series - The Values of Agile Development

scrapy achieve browsercookie Middleware

Daily

More

2024-05-19(0)

2024-05-18(31)

2024-05-17(6)

2024-05-16(23)

2024-05-15(5)

2024-05-14(9)

2024-05-13(8)

2024-05-12(28)

2024-05-11(32)

2024-05-10(34)