Reinforcement learning to adjust parametrized motor primitives to new situations

作者: Jens Kober , Andreas Wilhelm , Erhan Oztop , Jan Peters

DOI: 10.1007/S10514-012-9290-3

关键词: Contrast (statistics)Artificial intelligenceReinforcement learningComputer scienceGeneralizationThrowingLearning classifier systemMachine learningRobotSmall setTable (database)

摘要: Humans manage to adapt learned movements very quickly to new situations by generalizing learned behaviors from similar situations. In contrast, robots currently often need to re-learn the complete movement. In this paper, we propose a method that learns to generalize parametrized motor plans by adapting a small set of global parameters, called meta-parameters. We employ reinforcement learning to learn the required meta-parameters to deal with the current situation, described by states. We introduce an appropriate …

参考文章(47)
Craig A. Wrisberg, Richard A. Schmidt, Motor Learning and Performance ,(1991)
Stefan Schaal, Peyman Mohajerian, Auke Ijspeert, Dynamics systems vs. optimal control--a unifying view. Progress in Brain Research. ,vol. 165, pp. 425- 445 ,(2007) , 10.1016/S0079-6123(06)65027-9
Noah Cowan, Stuart Russell, Gregory Lawrence, Efficient gradient estimation for motor control learning uncertainty in artificial intelligence. pp. 354- 361 ,(2002)
Andrew G. Barto, Sridhar Mahadevan, Recent Advances in Hierarchical Reinforcement Learning Discrete Event Dynamic Systems. ,vol. 13, pp. 41- 77 ,(2003) , 10.1023/A:1022140919877
Christopher M. Bishop, Pattern Recognition and Machine Learning ,(2006)
Christopher K I Williams, Carl Edward Rasmussen, Gaussian Processes for Machine Learning ,(2005)
DARRIN C. BENTIVEGNA, CHRISTOPHER G. ATKESON, ALEŠ UDE, GORDON CHENG, LEARNING TO ACT FROM OBSERVATION AND PRACTICE International Journal of Humanoid Robotics. ,vol. 01, pp. 585- 611 ,(2004) , 10.1142/S0219843604000307
Wayco Scroggin, Stephen C. Jacobsen, Gordon Cheng, Sang-Ho Hyon, Jun Morimoto, Aleš Ude, Joshua G. Hale, Glenn Colvin, CB: A Humanoid Research Platform for Exploring NeuroScience ieee-ras international conference on humanoid robots. ,vol. 21, pp. 1097- 1114 ,(2006) , 10.1163/156855307781389356
Jens Kober, Jan Peters, Learning elementary movements jointly with a higher level task intelligent robots and systems. pp. 338- 343 ,(2011) , 10.1109/IROS.2011.6094834
Andrew P. Sage, L. N. Kanal, J. F. Lemmer, Uncertainty in Artificial Intelligence 2 ,(1988)