Leveraging Human Guidance for Deep Reinforcement Learning Tasks

作者: Ruohan Zhang , Faraz Torabi , Lin Guan , Dana H. Ballard , Peter Stone

DOI: 10.24963/IJCAI.2019/884

关键词: Action (philosophy)Sequential decisionHuman–computer interactionImitation learningReinforcement learningHuman knowledgeComputer science

摘要: Reinforcement learning agents can learn to solve sequential decision tasks by interacting with the environment. Human knowledge of how these be incorporated using imitation learning, where agent learns imitate human demonstrated decisions. However, guidance is not limited demonstrations. Other types could more suitable for certain and require less effort. This survey provides a high-level overview five recent frameworks that primarily rely on other than conventional, step-by-step action We review motivation, assumption, implementation each framework. then discuss possible future research directions.

参考文章(0)