Relational transfer in reinforcement learning

作者: Lisa Torrey , Jude Shavlik

DOI:

关键词:

摘要: Transfer learning is an inherent aspect of human learning. When humans learn to perform a task, we rarely start from scratch. Instead, recall relevant knowledge previous experiences and apply that help us master the new task more quickly. This principle can be applied machine as well. Machine often addresses single tasks in isolation. Even though multiple related may exist domain, many algorithms for have no way utilize those relationships. Algorithms allow successful transfer one (the source) another target) are necessary steps towards making adaptable thesis investigates methods reinforcement (RL), where agent takes series actions environment. RL requires substantial amounts nearly random exploration, particularly early stages The ability therefore important asset agents. source reduce low initial performance common challenging target tasks. I focus on transferring relational guides action choices. Relational typically uses first-order logic express information about relationships among objects. First-order logic, unlike propositional use variables generalize over classes This greater generalization makes effective transfer. contributes six three categories: advice-based transfer, macro MLN Advice-based source-task provide advice target-task learner, which follow, refine, or ignore according its value. Macro-transfer MLN-transfer experience demonstrate good behavior learner. evaluate these experimentally complex reinforcement-learning domain RoboCup simulated soccer. All my empirical benefits compared non-transfer approaches, either by increasing enabling faster task.

参考文章(84)
Satinder Singh, Vishal Soni, Using Homomorphisms to transfer options across continuous reinforcement learning domains national conference on artificial intelligence. pp. 494- 499 ,(2006)
John H. Flavell, Metacognitive aspects of problem solving The nature of intelligence. pp. 231- 235 ,(1976)
Hoifung Poon, Parag Singla, Stanley Kok, Matthew Richardson, Pedro Domingos, Unifying logical and statistical AI national conference on artificial intelligence. pp. 2- 7 ,(2006)
Kurt Driessens, Tom Croonenborghs, Jan Ramon, Transfer learning for reinforcement learning through goal and policy parametrization international conference on machine learning. pp. 1- 4 ,(2006)
Kurt Driessens, Saso Dzeroski, None, Integrating Experimentation and Guidance in Relational Reinforcement Learning international conference on machine learning. pp. 115- 122 ,(2002)
Erik Talvitie, Satinder Singh, An experts algorithm for transfer learning international joint conference on artificial intelligence. pp. 1065- 1070 ,(2007)
Jacob Abernethy, Peter Bartlett, Alexander Rakhlin, Multitask learning with expert advice conference on learning theory. pp. 484- 498 ,(2007) , 10.1007/978-3-540-72927-3_35
Dongkyu Choi, Pat Langley, Learning teleoreactive logic programs from problem solving inductive logic programming. pp. 51- 68 ,(2005) , 10.1007/11536314_4