[Learning] Reptile, gradient descent LSTM, Siamese Network, prototype network, matching network, relational network


Reptile

insert image description here
insert image description here
insert image description here

RNN

insert image description here

LSTM

Variation of RNN
insert image description here
insert image description here
Adding Gates
insert image description here
insert image description here

LSTM with Gradient Descent

insert image description here
GD looks like a simplified LSTM
insert image description here
that allows the machine to automatically learn these zf and zi.
insert image description here
The c and x of the typical LSTM have nothing to do with each other, but the θ of GD's LSTM is related to the gradient of the input.
insert image description here
insert image description here
Advantages: reasonable model size; in typical gradient descent, all parameters use the same update rule; training and testing model architectures can be different.
insert image description here
Parameter updates depend not only on the current gradient but also on previous gradients.
insert image description here
insert image description here
Join another LSTM, m stores the previous data
insert image description here

metrics-based approach

insert image description here
insert image description here
Learn a network that can simultaneously input training data and test data and output results.

insert image description here

insert image description here

Siamese Network

insert image description here
Siamese network-intuitive explanation: Binary classification problem: "Are they the same?"
insert image description here
insert image description here
The data obtained after the picture passes through cnn, if it is the same face, it will be very similar, and the difference will be far away.
auto-encoder will keep all the data of the picture.
What distance should we use?
insert image description here

N-way Few/One-shot Learning

Suppose the task is a classification question rather than a yes-no question.
insert image description here

prototype network

insert image description here

matching network

The s layer gets the output after going through the multi-hop similar memory network
insert image description here

relationship network

insert image description here
Use another network to calculate the similarity.

Few-shot learning on virtual data

insert image description here
Using the generator, a picture is generated through G to generate multiple pictures, and then put into the network.
insert image description here

Train + Test as RNN

insert image description here

Input the one-hot vector of embedding and category, it cannot be trained
insert image description here

Guess you like

Origin blog.csdn.net/Raphael9900/article/details/128644127