Reinforcement learning with perceptual aliasing: the perceptual distinctions approach

作者: Lonnie Chrisman

DOI:

关键词:

摘要: It is known that Perceptual Aliasing may significantly diminish the effectiveness of reinforcement learning algorithms [Whitehead and Ballard, 1991]. aliasing occurs when multiple situations are indistinguishable from immediate perceptual input require different responses system. For example, if a robot can only see forward, yet presence battery charger behind it determines whether or not should backup, perception alone insufficient for determining most appropriate action. problematic since typically learn control policy to optimal choice action. This paper introduces predictive distinctions approach compensate caused incomplete world. An additional component, model, utilized track aspects world be visible at all times. In addition policy, model must also learned, allow stochastic actions noisy perception, probabilistic learned experience. process, system discover, on its own, important in Experimental results given simple simulated domain, issues discussed.

参考文章(16)
Long-Ji Lin, Programming robots using reinforcement learning and teaching national conference on artificial intelligence. pp. 781- 786 ,(1991)
Ming Tan, Cost-sensitive reinforcement learning for adaptive classification and control national conference on artificial intelligence. pp. 774- 780 ,(1991)
Steven D. Whitehead, A complexity analysis of cooperative mechanisms in reinforcement learning national conference on artificial intelligence. pp. 607- 613 ,(1991)
Sridhar Mahadevan, Jonathan Connell, Automatic programming of behavior-based robots using reinforcement learning national conference on artificial intelligence. pp. 768- 773 ,(1991)
Sebastian B. Thrun, Efficient Exploration In Reinforcement Learning Carnegie Mellon University. ,(1992)
José del R. Millan, Carme Torras, Learning to Avoid Obstacles through Reinforcement Machine Learning Proceedings 1991. pp. 298- 302 ,(1991) , 10.1016/B978-1-55860-200-7.50062-3
Lambert E. Wixson, Scaling Reinforcement Learning Techniques via Modularity Machine Learning Proceedings 1991. pp. 368- 372 ,(1991) , 10.1016/B978-1-55860-200-7.50076-3
Leslie Pack Kaelbling, David Chapman, Input generalization in delayed reinforcement learning: an algorithm and performance comparisons international joint conference on artificial intelligence. pp. 726- 731 ,(1991)