Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching

作者: Long-Ji Lin

DOI: 10.1007/BF00992699

关键词:

摘要: To date, reinforcement learning has mostly been studied solving simple learning tasks. Reinforcement learning methods that have been studied so far typically converge slowly. The purpose of this work is thus two-fold: 1) to investigate the utility of reinforcement learning in solving much more complicated learning tasks than previously studied, and 2) to investigate methods that will speed up reinforcement learning. This paper compares eight reinforcement learning frameworks: adaptive heuristic critic (AHC) learning due to Sutton, Q …

参考文章(31)
Lawrence A. Birnbaum, Gregg C. Collins, Proceedings of the eighth international workshop on Machine learning ,(1991)
Long-Ji Lin, Programming robots using reinforcement learning and teaching national conference on artificial intelligence. pp. 781- 786 ,(1991)
Long-Ji Lin, Self-improvement Based On Reinforcement Learning, Planning and Teaching Machine Learning Proceedings 1991. pp. 323- 327 ,(1991) , 10.1016/B978-1-55860-200-7.50067-2
Long-Ji Lin, Self-improving reactive agents: case studies of reinforcement learning frameworks simulation of adaptive behavior. pp. 297- 305 ,(1991)
Steven D. Whitehead, Dana H. Ballard, A role for anticipation in reactive systems that learn international conference on machine learning. pp. 354- 357 ,(1989) , 10.1016/B978-1-55860-036-2.50090-4
Kevin J. Lang, A time delay neural network architecture for speech recognition Carnegie Mellon University. ,(1989)
Charles W. Anderson, Strategy Learning with Multilayer Connectionist Representations Proceedings of the Fourth International Workshop on MACHINE LEARNING#R##N#June 22–25, 1987 University of California, Irvine. pp. 103- 114 ,(1987) , 10.1016/B978-0-934613-41-5.50014-3