Learning to Play with Intrinsically-Motivated Self-Aware Agents

作者: Li Fei-Fei , Daniel L. K. Yamins , Nick Haber , Damian Mrowca

DOI:

关键词:

摘要: Infants are experts at playing, with an amazing ability to generate novel structured behaviors in unstructured environments that lack clear extrinsic reward signals. We seek mathematically formalize these abilities using a neural network implements curiosity-driven intrinsic motivation. Using simple but ecologically naturalistic simulated environment which agent can move and interact objects it sees, we propose "world-model" learns predict the dynamic consequences of agent's actions. Simultaneously, train separate explicit "self-model" allows track error map its own world-model, then uses self-model adversarially challenge developing world-model. demonstrate this policy causes explore informative interactions environment, leading generation spectrum complex behaviors, including ego-motion prediction, object attention, gathering. Moreover, world-model supports improved performance on dynamics detection, localization recognition tasks. Taken together, our results initial steps toward creating flexible autonomous agents self-supervise physical environments.

参考文章(43)
Diederik P. Kingma, Jimmy Ba, Adam: A Method for Stochastic Optimization arXiv: Learning. ,(2014)
Long-Ji Lin, Reinforcement learning for robots using neural networks Carnegie Mellon University. ,(1992)
Adrien Baranes, Pierre-Yves Oudeyer, Active learning of inverse models with intrinsically motivated goal exploration in robots Robotics and Autonomous Systems. ,vol. 61, pp. 49- 73 ,(2013) , 10.1016/J.ROBOT.2012.05.008
Karinna B. Hurley, Lisa M. Oakes, Experience and distribution of attention: Pet exposure and infants' scanning of animal images. Journal of Cognition and Development. ,vol. 16, pp. 11- 30 ,(2015) , 10.1080/15248372.2013.833922
Jacqueline Gottlieb, Pierre-Yves Oudeyer, Manuel Lopes, Adrien Baranes, Information-seeking, curiosity, and attention: computational and neural mechanisms Trends in Cognitive Sciences. ,vol. 17, pp. 585- 593 ,(2013) , 10.1016/J.TICS.2013.09.001
Jürgen Schmidhuber, Formal Theory of Creativity, Fun, and Intrinsic Motivation (1990–2010) IEEE Transactions on Autonomous Mental Development. ,vol. 2, pp. 230- 247 ,(2010) , 10.1109/TAMD.2010.2056368
Pierre-Yves Oudeyer, Frdric Kaplan, Verena V. Hafner, Intrinsic Motivation Systems for Autonomous Mental Development IEEE Transactions on Evolutionary Computation. ,vol. 11, pp. 265- 286 ,(2007) , 10.1109/TEVC.2006.890271