Model-Based Reinforcement Learning via Meta-Policy Optimization

作者: Pieter Abbeel , Tamim Asfour , John Schulman , Ignasi Clavera , Jonas Rothfuss

DOI:

关键词:

摘要: … the asymptotic performance of model-free methods while … model-based RL and approaches that combine elements of … the real environment, and fine-tuning the policy with VPG on the …

参考文章(0)