Dry [summary] | Deep Reinforcement Learning depth of reinforcement learning

        In machine learning, we often classified as supervised learning and unsupervised learning, but try ignores an important branch of reinforcement learning. Supervised learning and unsupervised learning to distinguish very well, learning objectives, with or without labels are case standard. If the goal is to predict the supervised learning, reinforcement learning is decision making so that the surrounding environment by constantly updating status, given rewards or punishment measures to constantly adjust and give the new strategy. In short, like a child you should not eat snack time eating snacks, your mom know you will make the punishment, then the next time it will not make the same mistake, if you follow the rules, then perhaps your mother will give you some incentive, the ultimate goal is you want to eat at the dinner, when to eat snacks to eat snacks, rather than snacking at inappropriate times. Similarly, Flappy bird has swept over a period of time, a lot of players in a short time to reach the high score, is how to do it? In addition to very powerful players are really playing their own hand high, in fact, a lot of high scores through reinforcement learning method we use to train a model, learn how to let the birds do not fly straight ahead and encountered obstacles, the highest Minute. In addition, the well-known Alpha Go, actually strengthening the model of learning and training, but the depth of reinforcement learning.

 
  After 2013 DeepMind published a Playing Atari with Deep Reinforcement Learning article, the depth of reinforcement learning it slowly into people's vision. Later, in 2015, DeepMind has published a Human Level Control through Deep Reinforcement Learning, so that the depth of intensive study to get a lot of attention, then the emergence of many academic achievements. We are familiar with the depth of reinforcement learning should be in the 16 to 17 years time, especially after the Alpha Go occur, many companies and a large number of researchers have begun to focus on the depth of reinforcement learning, and try to apply it in various application scenarios.
 
  About the depth of reinforcement learning, I compiled some information, if interested can learn about:
 
【paper】
 
[Blog]
 
【article】
 
【course】
 
 
【学习网站】
 
【Github】
 
 
【会议】

Guess you like

Origin www.cnblogs.com/zhangchao162/p/11417563.html