Combining model-based design and model-free policy optimization to learn safe, stabilizing controllers

作者： Tyler Westenbroek , Ayush Agrawal , Fernando Castañeda , S Shankar Sastry , Koushil Sreenath

DOI:

关键词:

摘要: This paper introduces a framework for learning a safe, stabilizing controller for a system with unknown dynamics using model-free policy optimization algorithms. Using a nominal dynamics model, the user specifies a candidate Control Lyapunov Function (CLF) around the desired operating point, and specifies the desired safe-set using a Control Barrier Function (CBF). Using penalty methods from the optimization literature, we then develop a family of policy optimization problems which attempt to minimize control effort while satisfying the pointwise constraints used to specify the CLF and CBF. We demonstrate that when the penalty terms are scaled correctly, the optimization prioritizes the maintenance of safety over stability, and stability over optimality. We discuss how standard reinforcement learning algorithms can be applied to the problem, and validate the approach through simulation. We then illustrate how the …

sciencedirect.com 本地加速

nsf.gov PDF 下载加速

参考文章(0)

Combining model-based design and model-free policy optimization to learn safe, stabilizing controllers

来源期刊

我的账户

Combining model-based design and model-free policy optimization to learn safe, stabilizing controllers

来源期刊

相似文章 0

我的账户