【5分钟 Paper】Continuous Control With Deep Reinforcement Learning

  • Thesis title : Continuous Control With Deep Reinforcement Learning

Title and author information

The problem solved?

  This article will be Deep Q-Learningapplied to the Deterministic Policy Gradientalgorithm. If you know DPG, then this article is to introduce DQNimproved a bit DPGof state value function. It solves the limitation of the DQNneed to find maximizes action-valueonly the discrete action space.

background

  In fact, it is a combination of these two articles:

The method used?

  This DDPGI too familiar, I really do not want to write what the appendix it a pseudo-code:

DDPG algorithm

The effect achieved?

  The experimental results are shown below:

Insert picture description here

Information published? author information?

  This article is the one ICLR2016above. The first author TimothyP.Lillicrapis Google DeepMindof research Scientist.

  Research focuses on machine learning and statistics for optimal control and decision making, as well as using these mathematical frameworks to understand how the brain learns. In recent work, I’ve developed new algorithms and approaches for exploiting deep neural networks in the context of reinforcement learning, and new recurrent memory architectures for one-shot learning. Applications of this work include approaches for recognizing images from a single example, visual question answering, deep learning for robotics problems, and playing games such as Go and StarCraft. I’m also fascinated by the development of deep network models that might shed light on how robust feedback control laws are learned and employed by the central nervous system.

  • Personal homepage: http://contrastiveconvergence.net/~timothylillicrap/index.php

Author avatar

My micro-channel public number Name : deep learning and advanced intelligent decision-making
micro-channel public number ID : MultiAgent1024
Public Number Description : The main share deep learning, computer games, reinforcement learning, and other related content! Looking forward to your attention, welcome to learn and exchange progress together!

Published 185 original articles · praised 168 · 210,000 views

Guess you like

Origin blog.csdn.net/weixin_39059031/article/details/104612372