Transfer in reinforcement learning via shared features

作者: Ilya Scheidwasser , George Konidaris , Andrew G. Barto

DOI: 10.5555/2188385.2343689

关键词:

摘要: We present a framework for transfer in reinforcement learning based on the idea that related tasks share some common features, and can be achieved via those shared features. The attempts to capture notion of are but distinct, provides insight into when usefully applied problem sequence it cannot. apply knowledge problem, show an agent learn portable shaping function from experience significantly improve performance later task, even given very brief training period. also skill transfer, agents skills across tasks, approaching perfectly learned problem-specific skills.

参考文章(56)
Andrew G. Barto, Richard S. Sutton, Oliver G. Selfridge, Training and tracking in robotics international joint conference on artificial intelligence. pp. 670- 672 ,(1985)
Balaraman Ravindran, Andrew G. Barto, SMDP homomorphisms: an algebraic approach to abstraction in semi-Markov decision processes international joint conference on artificial intelligence. pp. 1011- 1016 ,(2003)
D. S. Bernstein, Reusing Old Policies to Accelerate Learning on New MDPs TITLE2 University of Massachusetts. ,(1999)
D. Precup, T. J. Perkins, Using Options for Knowledge Transfer in Reinforcement Learning TITLE2 University of Massachusetts. ,(1999)
Philip E. Agre, David Chapman, Pengi: an implementation of a theory of activity national conference on artificial intelligence. pp. 268- 272 ,(1987)
Tom Croonenborghs, Kurt Driessens, Maurice Bruynooghe, Learning relational options for inductive transfer in relational reinforcement learning inductive logic programming. ,vol. 4894, pp. 88- 97 ,(2007) , 10.1007/978-3-540-78469-2_12
Doina Precup, Satinder P. Singh, Richard S. Sutton, Intra-Option Learning about Temporally Abstract Actions international conference on machine learning. pp. 556- 564 ,(1998)
Jette Randløv, Preben Alstrøm, Learning to Drive a Bicycle Using Reinforcement Learning and Shaping international conference on machine learning. pp. 463- 471 ,(1998)
Lisa Torrey, Jude Shavlik, Trevor Walker, Richard Maclin, Skill Acquisition Via Transfer Learning and Advice Taking Lecture Notes in Computer Science. pp. 425- 436 ,(2006) , 10.1007/11871842_41
Maja J. Matarić, Reinforcement learning in the multi-robot domain Autonomous Robots. ,vol. 4, pp. 73- 83 ,(1997) , 10.1023/A:1008819414322