作者: Stefano Ermon , Yanan Sui , Jiaming Song , Yang Song , Kuno Kim
DOI:
关键词:
摘要: We propose a new framework for Imitation Learning (IL) via density estimation of the expert's occupancy measure followed by Maximum Occupancy Entropy Reinforcement Learning …