Random Thoughts on Deep Reinforcement Learning - 代码天地

Random Thoughts on Deep Reinforcement Learning

其他 2020-01-18 07:50:52 阅读次数: 0

About model-based and model-free
Model-free methods cannot be the future of reinforcement learnig, even though these algorithms perform better than model-based methods at the present time. The fatal flaw lies in the lack of interpretability. We cannot trust the policy without knowing why it takes a specific action, especially since it always takes some actions that are stupid and obviously wrong in our view. Model-based methods relieve our concerns to some extent, because we can get some knowledge about future states and outcomes. However, the model should be learned in most of the time and it cannot be accurate like the real environment. A way we can solve it must be planning methods especially tree search methods like Monte Carlo Tree Search (MCTS). Tree search methods can reduce the variance of the learned model using bootstrapping at each node, which is something like TD methods. It also presents us with better interpretability which is very critical.

猜你喜欢

转载自www.cnblogs.com/initial-h/p/12208038.html

Random Thoughts on Deep Reinforcement Learning

Relational Deep Reinforcement Learning

022 Deep Reinforcement Learning

Deep Reinforcement Learning is a waste of time

# Asynchronous Methods for Deep Reinforcement Learning

Asynchronous Methods for Deep Reinforcement Learning

Deep Reinforcement Learning with Double Q-learning

Deep Reinforcement Learning: Pong from Pixels

Deep Reinforcement Learning 深度增强学习资源

Deep Reinforcement Learning 基础知识

Deep Reinforcement Learning （paper reading notes）

解读continuous control with deep reinforcement learning（DDPG）

Playing Atari with Deep Reinforcement Learning论文解读

Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning

Deep Reinforcement Learning with Iterative Shift for Visual Tracking

Dueling Network Architectures for Deep Reinforcement Learning: DuelingDQN

李宏毅Deep Reinforcement Learning笔记

算法笔记：Playing Atari with Deep Reinforcement Learning

Exploration Strategies in Deep Reinforcement Learning (2)

Exploration Strategies in Deep Reinforcement Learning (1)

Deep Reinforcement Learning for AutomatedStock Trading: An Ensemble Strategy

DQN Tutorial – Deep Reinforcement Learning with PyTorch

强化学习资源——Hands-On Reinforcement Learning、Deep Reinforcement Learning Hands-On等

Deep RL Bootcamp Lecture 10B Inverse Reinforcement Learning

Deep Direct Reinforcement Learning for Financial Signal Representation and Trading

Deep Reinforcement Learning 基础知识（DQN方面）

CVPR2018_Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning

[转]Deep Reinforcement Learning Based Trading Application at JP Morgan Chase

Deep Reinforcement Learning 基础知识（DQN方面）

深度强化学习：入门(Deep Reinforcement Learning: Scratching the surface)

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

周排行

Python环境安装与基础语法（1）——计算机基础知识

IMU预积分

ADAS中的LDW、FCW、BSD、LCA、ACC、AEB、APA、DMS代表的含义

B站笔试两道题

skyeye arm 硬件虚拟机环境的搭建

Web前端静态页面示例

数组-合并排序数组 II-简单

springcloud之版本问题启动报错

面向对象-------------匿名对象(六)

输入URL到页面呈现中间发生了什么？

每日归档

更多

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)