Reinforcement learning-an introduction Reinforcement learning translation section 1.6 - Code World

Reinforcement learning-an introduction Reinforcement learning translation section 1.6

Others 2020-10-26 13:41:09 views: null

1.6 Summary

Reinforcement learning is a computational method for understanding and automating goal-oriented learning and decision-making. The difference between it and other computing methods is that it emphasizes that an agent learns from direct interaction with its environment, without the need for exemplary supervision or a complete model of the environment. In our opinion, reinforcement learning is the first field to seriously solve the computational problems arising from interactive learning with the environment in order to achieve long-term goals.

Reinforcement learning uses the formal framework of the Markov decision process to define the interaction between the learning agent and the environment, including state, action and reward. This framework aims to be a simple way to express the essential characteristics of artificial intelligence problems. These characteristics include a sense of cause and effect, uncertainty and uncertainty, and the existence of clear goals.

The concept of value and value function is the key to most of the reinforcement learning methods we consider in this book. We believe that in the policy space, the value function is important for effective search. The use of value function distinguishes reinforcement learning methods from evolutionary methods, which directly search the strategy space under the guidance of the entire strategy evaluation.

Guess you like

Origin blog.csdn.net/wangyifan123456zz/article/details/107381072

Reinforcement learning-an introduction Reinforcement learning translation section 1.6

Reinforcement learning-an introduction Reinforcement learning translation section 1.7

Reinforcement learning-an introduction Reinforcement learning translation section 1.4

Reinforcement learning-an introduction Reinforcement learning translation section 1.3

Reinforcement learning-an introduction Reinforcement learning translation section 1.2

Reinforcement learning-an introduction Reinforcement learning translation section 1.1

【Reinforcement Learning Knowledge】Introduction to Reinforcement Learning

[Reinforcement Learning] 01 - Introduction to Reinforcement Learning

Reinforcement Learning

Reinforcement learning in machine translation: advantages, disadvantages and disadvantages

Reinforcement learning (1): Introduction-what is reinforcement learning?

Tensorflow reinforcement learning (Reinforcement learning)

Introduction and reinforcement learning Markov Decision Process

Reinforcement Learning: An Introduction study notes (5)

Reinforcement Learning: An Introduction study notes (2)

Introduction to the application of deep reinforcement learning + financial investment

[Deep learning] Reinforcement learning

【Learning】Deep Reinforcement Learning

Understanding of RL (reinforcement learning)-reinforcement learning

Chapter 2 Reinforcement Learning and Deep Reinforcement Learning

Reinforcement learning-Basics of Reinforcement Learning

Reinforcement Learning - Concept 05: Inverse Reinforcement Learning

Reinforcement Learning Algorithm

Policy in Reinforcement Learning

Reinforcement Learning Cheatsheet

Reinforcement learning Chapter VII

Reinforcement Learning - Getting Started

Reinforcement learning third chapters

Reinforcement Learning Chapter VI

Reinforcement Learning Quick Start

Recommended

Ranking

ElasticSearch-- data modeling best practices

Permission Maintenance - Shadow User Backdoor

Refactor the code using MVP mode

Quantitative investment-fundamental model-PVC multi-factor model

Spark Big Data Processing Lecture Notes 3.2 Mastering RDD Operators

Blazor page components (2)

Erlernen von Kenntnissen zur Android-Entwicklung – Kodierung, Verschlüsselung, Hash, Serialisierung und Zeichensätze

About Qi high in JAVA study notes SORM summary detailed personal explanation

Will you calculate the accuracy of the rope displacement sensor in the measurement?

OPENJTAG debugging learning (3): debugging using the gdb command line

Daily

More

2024-05-01(4)

2024-04-30(36)

2024-04-29(5)

2024-04-28(12)

2024-04-27(29)

2024-04-26(22)

2024-04-25(32)

2024-04-24(30)

2024-04-23(30)

2024-04-22(5)