A Contraction Approach to Model-based Reinforcement Learning.

作者: Peter J. Ramadge , Ting-Han Fan

DOI:

关键词:

摘要: Despite its experimental success, Model-based Reinforcement Learning still lacks a complete theoretical understanding. To this end, we analyze the error in the cumulative …

参考文章(31)
Cédric Villani, Optimal Transport: Old and New ,(2016)
Richard S. Sutton, Dyna, an integrated architecture for learning, planning, and reacting Intelligence\/sigart Bulletin. ,vol. 2, pp. 160- 163 ,(1991) , 10.1145/122344.122377
Umar Syed, Michael Bowling, Robert E. Schapire, Apprenticeship learning using linear programming Proceedings of the 25th international conference on Machine learning - ICML '08. pp. 1032- 1039 ,(2008) , 10.1145/1390156.1390286
Robert E Schapire, Umar Syed, A Reduction from Apprenticeship Learning to Classification neural information processing systems. ,vol. 23, pp. 2253- 2261 ,(2010)
Emanuel Todorov, Tom Erez, Yuval Tassa, MuJoCo: A physics engine for model-based control intelligent robots and systems. pp. 5026- 5033 ,(2012) , 10.1109/IROS.2012.6386109
Ilya Sutskever, Shixiang Gu, Sergey Levine, Timothy Lillicrap, Continuous deep Q-learning with model-based acceleration international conference on machine learning. pp. 2829- 2838 ,(2016)
Edwin Hewitt, Kenneth Allen Ross, Abstract Harmonic Analysis ,(1963)
The Contraction Mapping Theorem WORLD SCIENTIFIC. pp. 61- 79 ,(2001) , 10.1142/9789812810670_0003
Pieter Abbeel, Tamim Asfour, John Schulman, Ignasi Clavera, Jonas Rothfuss, Yasuhiro Fujita, Model-Based Reinforcement Learning via Meta-Policy Optimization Conference on Robot Learning. pp. 617- 629 ,(2018)
Pieter Abbeel, Jimmy Ba, Xuchan Bao, Guodong Zhang, Yeming Wen, Tingwu Wang, Jerrick Hoang, Shunshi Zhang, Eric Langlois, Ignasi Clavera, Benchmarking Model-Based Reinforcement Learning arXiv: Learning. ,(2019)