Learning of information gathering in modular intelligent systems

作者: Cem Karaoguz

DOI:

关键词:

摘要: Research of intelligent systems aims to realize autonomous agents capable performing various functions ease every day life humans. Usually, such occupations can be formalized as a collection tasks that have executed in parallel or sequence. Since real world environments are highly dynamic and unpredictable, require cognitive capabilities learn how execute through interactions. Considering the system has limited resources for acquiring processing information, strategy is required find update task-relevant information sources efficiently time. This thesis proposes level approach gathering process an implementation puts this idea into work. The presented framework takes modular where modules defined elementary units acquisition processing. design helps handling scenario complexity. A module management mechanism learns which deliver task relevant constrained distributed among these reward based framework. This reduces partial observability caused by provides better support other high functionalities system. Such adaptive also makes it possible deal with variations environment. Two different applications simulation implemented test hypotheses demonstrate utility proposed framework: first implements `reaching-while-interacting' humanoid robot second employing navigation mobile robot. Both scenarios involve objects, rendering challenging environment close real-world conditions Results from experiments provide evidence postulated thesis.

参考文章(56)
Alexander Andreopoulos, Stephan Hasler, Heiko Wersing, Herbert Janssen, John K. Tsotsos, Edgar Korner, Active 3D Object Localization Using a Humanoid Robot IEEE Transactions on Robotics. ,vol. 27, pp. 47- 64 ,(2011) , 10.1109/TRO.2010.2090058
Henrik Jacobsson, Nick Hawes, Geert-Jan Kruijff, Jeremy Wyatt, Crossmodal content binding in information-processing architectures Proceedings of the 3rd international conference on Human robot interaction - HRI '08. pp. 81- 88 ,(2008) , 10.1145/1349822.1349834
Ronald Parr, Lihong Li, Gavin Taylor, Christopher Painter-Wakefield, Michael L. Littman, An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning Proceedings of the 25th international conference on Machine learning - ICML '08. pp. 752- 759 ,(2008) , 10.1145/1390156.1390251
Hazem Toutounji, Constantin A. Rothkopf, Jochen Triesch, Scalable reinforcement learning through hierarchical decompositions for weakly-coupled problems international conference on development and learning. ,vol. 2, pp. 1- 7 ,(2011) , 10.1109/DEVLRN.2011.6037351
Noam Chomsky, Rules and Representations ,(1980)
C. Kwok, D. Fox, Reinforcement learning for sensing strategies intelligent robots and systems. ,vol. 4, pp. 3158- 3163 ,(2004) , 10.1109/IROS.2004.1389903
Laurent Itti, Christof Koch, Computational modelling of visual attention. Nature Reviews Neuroscience. ,vol. 2, pp. 194- 203 ,(2001) , 10.1038/35058500
V. Tikhanoff, A. Cangelosi, P. Fitzpatrick, G. Metta, L. Natale, F. Nori, An open-source simulator for cognitive robotics research Proceedings of the 8th Workshop on Performance Metrics for Intelligent Systems - PerMIS '08. pp. 57- 61 ,(2008) , 10.1145/1774674.1774684