作者: Michael Herman , Tobias Gindele , Jörg Wagner , Felix Schmitt , Wolfram Burgard
DOI:
关键词:
摘要: This document contains supplementary material to the paper Inverse Reinforcement Learning with Simultaneous Estimation of Rewards and Dynamics with more detailed derivations, additional proofs to lemmata and theorems as well as larger illustrations and plots of the evaluation task.