Batch Reinforcement Learning

作者: Sascha Lange , Thomas Gabel , Martin Riedmiller

DOI: 10.1007/978-3-642-27645-3_2

关键词:

摘要: Batch reinforcement learning is a subfield of dynamic programming-based learning. Originally defined as the task best possible policy from fixed set priori-known transition samples, (batch) algorithms developed in this field can be easily adapted to classical online case, where agent interacts with environment while Due efficient use collected data and stability process, research area has attracted lot attention recently. In chapter, we introduce basic principles theory behind batch learning, describe most important algorithms, exemplarily discuss ongoing within field, briefly survey real-world applications

参考文章(45)
Michail G. Lagoudakis, Ronald Parr, Model-Free Least-Squares Policy Iteration neural information processing systems. ,vol. 14, pp. 1547- 1554 ,(2001)
Geoffrey E Hinton, Ruslan R Salakhutdinov, Reducing the Dimensionality of Data with Neural Networks Science. ,vol. 313, pp. 504- 507 ,(2006) , 10.1126/SCIENCE.1127647
Dirk Ormoneit, Peter W. Glynn, Kernel-Based Reinforcement Learning in Average-Cost Problems: An Application to Optimal Portfolio Choice neural information processing systems. ,vol. 13, pp. 1068- 1074 ,(2000)
Stephan Timmer, Martin Riedmiller, Fitted Q Iteration with CMACs 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning. pp. 1- 8 ,(2007) , 10.1109/ADPRL.2007.368162
Jan Peters, Sethu Vijayakumar, Stefan Schaal, Natural Actor-Critic Machine Learning: ECML 2005. pp. 280- 291 ,(2005) , 10.1007/11564096_29
Martin Riedmiller, Roland Hafner, Sascha Lange, Martin Lauer, Learning to dribble on a real robot by success and failure international conference on robotics and automation. pp. 2207- 2208 ,(2008) , 10.1109/ROBOT.2008.4543536
Louis Wehenkel, Pierre Geurts, Damien Ernst, Tree-Based Batch Mode Reinforcement Learning Journal of Machine Learning Research. ,vol. 6, pp. 503- 556 ,(2005)
A.G. Barto, R.S. Sutton, Reinforcement Learning: An Introduction ,(1988)
Tommi Jaakkola, Satinder P. Singh, Michael I. Jordan, Reinforcement Learning with Soft State Aggregation neural information processing systems. ,vol. 7, pp. 361- 368 ,(1994)
Martin Riedmiller, Thomas Gabel, Roland Hafner, Sascha Lange, Reinforcement learning for robot soccer Autonomous Robots. ,vol. 27, pp. 55- 73 ,(2009) , 10.1007/S10514-009-9120-4