Learning to coordinate visual behaviors

作者: Dana Ballard , Nathan Sprague

DOI:

关键词:

摘要: This dissertation explores the problem of visually guided control. The focus is not on details image processing, but understanding role that vision plays within context an active agent. More specifically, we managing in multiple goal tasks. When tasks are addressed simultaneously conflicts arise because limitations sensor and effector availability computational capacity. describes principled ways handling those using a decision theoretic approach. test bed for this work graphical human processes rendered video stream order to navigate through realistically modeled urban environment. understand behavior both as it relates engineering embodied mobile agents science vision. We demonstrate approach virtual agent, also present experimental results illustrating same framework can effectively model eye movement scheduling.

参考文章(83)
Norman I Badler, None, Real-time virtual humans pacific conference on computer graphics and applications. pp. 4- 13 ,(1997) , 10.1109/PCCGA.1997.626166
T. G. Dietterich, Hierarchical reinforcement learning with the MAXQ value function decomposition Journal of Artificial Intelligence Research. ,vol. 13, pp. 227- 303 ,(2000) , 10.1613/JAIR.639
A.G. Barto, R.S. Sutton, Reinforcement Learning: An Introduction ,(1988)
Richard S Sutton, Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding neural information processing systems. ,vol. 8, pp. 1038- 1044 ,(1995)
Justin A. Boyan, Andrew W. Moore, Generalization in Reinforcement Learning: Safely Approximating the Value Function neural information processing systems. ,vol. 7, pp. 369- 376 ,(1994)
R. Sun, C. Sessions, Self-segmentation of sequences: automatic formation of hierarchies of sequential behaviors systems man and cybernetics. ,vol. 30, pp. 403- 418 ,(2000) , 10.1109/3477.846230
Satinder P. Singh, David Cohn, How to Dynamically Merge Markov Decision Processes neural information processing systems. ,vol. 10, pp. 1057- 1063 ,(1997)
Mary Hayhoe, Vision using routines: A functional account of vision. Visual Cognition. ,vol. 7, pp. 43- 64 ,(2000) , 10.1080/135062800394676
R.C. Arkin, D. MacKenzie, Temporal coordination of perceptual algorithms for mobile robot navigation international conference on robotics and automation. ,vol. 10, pp. 276- 286 ,(1994) , 10.1109/70.294203
Stuart Russell, Andrew L. Zimdars, Q-decomposition for reinforcement learning agents international conference on machine learning. pp. 656- 663 ,(2003)