Intrinsically Motivated Decision Making for Situated, Goal-Driven Agents

作者: Mohamed Oubbati , Christian Fischer , Günther Palm

DOI: 10.1007/978-3-319-08864-8_16

关键词:

摘要: Goal-driven agents are generally expected to be capable of pursuing simultaneously a variety goals. As these goals may compete in certain circumstances, the agent must able constantly trade them off and shift their priorities rational way. One aspect rationality is evaluate its needs make decisions accordingly. We endow with set needs, or drives, that change over time as function external stimuli internal consumption, decision making process hast generate actions maintain balance between needs. The proposed framework pursues an approach which considered multiobjective problem approximately solved using hierarchical reinforcement learning architecture. At higher-level, Q-learning learns select best strategy improves well-being agent. lower-level, actor-critic design executes selected while interacting continuous, partially observable environment. provide simulation results demonstrate efficiency approach.

参考文章(21)
Ashwin Ram, David B Leake, None, Goal-driven learning MIT Press. ,(1995)
Stéphane Doncieux, Benoît Girard, Agnès Guillot, Jean-Baptiste Mouret, John Hallam, Jean-Arcady Meyer, From Animals to Animats 10 ,(2008)
Célia da Costa Pereira, Andrea G. B. Tettamanzi, A possibilistic approach to goal generation in cognitive agents international conference industrial engineering other applications applied intelligent systems. ,vol. 6097, pp. 397- 406 ,(2010) , 10.1007/978-3-642-13025-0_42
George Konidaris, Andrew Barto, An Adaptive Robot Motivational System From Animals to Animats 9. ,vol. 4095, pp. 346- 356 ,(2006) , 10.1007/11840541_29
Ulit Jaidee, David W. Aha, Héctor Muñoz-Avila, Integrated learning for goal-driven autonomy international joint conference on artificial intelligence. pp. 2450- 2455 ,(2011) , 10.5591/978-1-57735-516-8/IJCAI11-408
Peter Dayan, Goal-directed control and its antipodes Neural Networks. ,vol. 22, pp. 213- 219 ,(2009) , 10.1016/J.NEUNET.2009.03.004
Mohamed Oubbati, Bahram Kord, Petia Koprinkova-Hristova, Günther Palm, Learning of embodied interaction dynamics with recurrent neural networks: some exploratory experiments Journal of Neural Engineering. ,vol. 11, pp. 026019- ,(2014) , 10.1088/1741-2560/11/2/026019
M. A. Salichs, M. Malfaz, A New Approach to Modeling Emotions and Their Use on a Decision-Making System for Artificial Agents IEEE Transactions on Affective Computing. ,vol. 3, pp. 56- 68 ,(2012) , 10.1109/T-AFFC.2011.32
Dongkyu Choi, Reactive goal management in a cognitive architecture Cognitive Systems Research. ,vol. 12, pp. 293- 308 ,(2011) , 10.1016/J.COGSYS.2010.09.002