Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching

作者： Long-Ji Lin

关键词:

摘要: To date, reinforcement learning has mostly been studied solving simple learning tasks. Reinforcement learning methods that have been studied so far typically converge slowly. The purpose of this work is thus two-fold: 1) to investigate the utility of reinforcement learning in solving much more complicated learning tasks than previously studied, and 2) to investigate methods that will speed up reinforcement learning. This paper compares eight reinforcement learning frameworks: adaptive heuristic critic (AHC) learning due to Sutton, Q …

springer.com PDF 下载加速

acm.org LINK 下载加速

sci-hub.se PDF 下载加速

springer.com PDF 下载加速

参考文章(31)

Lawrence A. Birnbaum, Gregg C. Collins, Proceedings of the eighth international workshop on Machine learning ,(1991)

Long-Ji Lin, Programming robots using reinforcement learning and teaching national conference on artificial intelligence. pp. 781- 786 ,(1991)

Andrew W. Moore, Variable Resolution Dynamic Programming: Efficiently Learning Action Maps in Multivariate Real-valued State-spaces Machine Learning Proceedings 1991. pp. 333- 337 ,(1991) , 10.1016/B978-1-55860-200-7.50069-6

Richard S. Sutton, Integrated architecture for learning, planning, and reacting based on approximating dynamic programming international conference on machine learning. pp. 216- 224 ,(1990) , 10.1016/B978-1-55860-141-3.50030-4

Long-Ji Lin, Self-improvement Based On Reinforcement Learning, Planning and Teaching Machine Learning Proceedings 1991. pp. 323- 327 ,(1991) , 10.1016/B978-1-55860-200-7.50067-2

Long-Ji Lin, Self-improving reactive agents: case studies of reinforcement learning frameworks simulation of adaptive behavior. pp. 297- 305 ,(1991)

Steven D. Whitehead, Dana H. Ballard, A role for anticipation in reactive systems that learn international conference on machine learning. pp. 354- 357 ,(1989) , 10.1016/B978-1-55860-036-2.50090-4

Kevin J. Lang, A time delay neural network architecture for speech recognition Carnegie Mellon University. ,(1989)

Charles W. Anderson, Strategy Learning with Multilayer Connectionist Representations Proceedings of the Fourth International Workshop on MACHINE LEARNING#R##N#June 22–25, 1987 University of California, Irvine. pp. 103- 114 ,(1987) , 10.1016/B978-0-934613-41-5.50014-3

10.

Ming Tan, Learning a Cost-Sensitive Internal Representation for Reinforcement Learning Machine Learning Proceedings 1991. pp. 358- 362 ,(1991) , 10.1016/B978-1-55860-200-7.50074-X

Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching

来源期刊

我的账户

Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching

来源期刊

相似文章 10

我的账户