搜索历史记录选项已关闭,请开启搜索历史记录选项。
作者: Marek Petrik , Scott Niekum , Daniel S. Brown
DOI:
关键词:
摘要: One of the main challenges in imitation learning is determining what action an agent should take when outside the state distribution of the demonstrations. Inverse reinforcement …