Model-Based Reinforcement Learning via Meta-Policy Optimization

作者： Pieter Abbeel , Tamim Asfour , John Schulman , Ignasi Clavera , Jonas Rothfuss

DOI:

关键词:

摘要: … the asymptotic performance of model-free methods while … model-based RL and approaches that combine elements of … the real environment, and fine-tuning the policy with VPG on the …

washington.edu PDF 下载加速

参考文章(0)

Model-Based Reinforcement Learning via Meta-Policy Optimization

来源期刊

我的账户

Model-Based Reinforcement Learning via Meta-Policy Optimization

来源期刊

相似文章 0

我的账户