Reinforcement learning loss function does not decline - Code World

Reinforcement learning loss function does not decline

Others 2020-03-19 18:39:41 views: null

Problem Description

PPO algorithm using the training gym.make('CartPole-v0')environment.
Parameters are as follows:

hidden_units = 50
layers = 3
learning_rate = 0.001 # critic 和 actor learning rate相同
max_train_episodes = int(1e4)

During training effect gradually changed for the better, an increase of 50 steps per average reward, but the loss function does not decline
Here Insert Picture Description

But the training process of critic loss and actor loss (tensorboard) has not declined

Here Insert Picture Description

Cause Analysis

As the training progresses, the data in the Buffer increasing data has been dynamic, therefore actor and critic of the training data set is dynamic, this fixed and supervised learning data sets are different, so the loss does not show a downward trend.
Reference:
https://stackoverflow.com/questions/47036246/dqn-q-loss-not-converging

Bin -

Published 36 original articles · won praise 0 · views 20000 +

Private letter concerns

Guess you like

Origin blog.csdn.net/weixin_38102912/article/details/97614897

Reinforcement learning loss function does not decline

Machine learning: Loss loss function

Deep learning - loss function (loss)

Machine learning loss function (Loss Function)

Machine Learning common loss function

Loss of function of the depth of learning Summary

Complexity & learning rate & loss function

[Machine Learning] Loss Function DLC

Basics of machine learning - loss function and risk function

[Learning record] activation function + loss function

Machine Learning Summary: Several common loss function (Loss function)

Policy Gradient Methods for Reinforcement Learning with Function Approximation

(Reprint) [deep learning] loss does not drop solution

Deep learning (23): SmoothL1Loss loss function

Finishing machine learning commonly used loss function

Deep learning TensorFlow notes - loss function

Loss function Keras deep learning framework

Deep learning-study notes for loss function

Introduction to the optimization loss function algorithm of deep learning

01 Gradient descent, learning rate, loss function

Machine learning loss function-python implementation

[Deep Learning Theory] (1) Loss function

[Organization and summary of loss function in deep learning]

The solution to the unsatisfactory loss function result - machine learning

Deep learning - damage function (dice_loss)

Deep learning (1) - the causes and consequences of the loss function

[Deep Learning] Analysis of Classification Loss Function

[Machine Learning] Loss function and optimization process

Loss Function - Perceptual Loss

Loss Function - Perceptual Loss

Recommended

Arc Browser for Windows 1.0 officially GA

A programmer born in the 1990s developed a video porting software and made over 7 million in less than a year. The ending was very punishing!

Ranking

1. Select Sort

Create a thread thread

3 press to play ball that reach 6

Programmation CUDA (4) : gestion de la mémoire

SpringBoot database connection pool Druid error

E Diudiu App redesign summary

4EVERLAND Hosting now supports SNS+IPFS

About HTTPS

[vue3+vite+ts+element-plu+sass] uses bug records in sass

Interpretation of HUAWEI CLOUD GaussDB (for Influx): Best Practice Data Modeling

Daily

More

2024-05-03(8)

2024-05-02(0)

2024-05-01(4)

2024-04-30(36)

2024-04-29(5)

2024-04-28(12)

2024-04-27(29)

2024-04-26(22)

2024-04-25(32)

2024-04-24(30)