搜索历史记录选项已关闭,请开启搜索历史记录选项。
作者: Peter J. Ramadge , Ting-Han Fan
DOI:
关键词:
摘要: Despite its experimental success, Model-based Reinforcement Learning still lacks a complete theoretical understanding. To this end, we analyze the error in the cumulative …