Global optimality of approximate dynamic programming and its use in non-convex function minimization

作者： Ali Heydari , S.N. Balakrishnan

DOI: 10.1016/J.ASOC.2014.07.003

关键词:

摘要: Level curves of the Rosenbrock function subject to minimization and state trajectories for different initial conditions x0ź{-2, -1, 0, 1, 2}×{-2, 2}. The red plus signs denote point respective trajectory. This study investigates global optimality approximate dynamic programming (ADP) based solutions using neural networks optimal control problems with fixed final time. Issues including whether or not cost terms system dynamics need be convex functions respect their inputs are discussed sufficient result derived. Next, a new idea is presented use ADP optimization non-convex smooth functions. It shown that any guess leads direct movement toward proximity optimum function. behavior in contrast gradient methods which guided by shape local level curves. Illustrative examples provided single multi-variable demonstrate potential proposed method.

uni-trier.de 本地加速

sciencedirect.com 本地加速

acm.org 本地加速

sciencedirect.com LINK 下载加速

sci-hub.se PDF 下载加速

参考文章(33)

Reinforcement Learning and Approximate Dynamic Programming for Feedback Control John Wiley and Sons. ,(2012) , 10.1002/9781118453988

William F. Trench, Introduction to Real Analysis ,(1982)

Michael Stone, Paul Goldbart, Mathematics for Physics Cambridge University Press. ,(2009) , 10.1017/CBO9780511627040

Chi-Tsong Chen, Linear System Theory and Design ,(1995)

Donald E. Kirk, Optimal control theory : an introduction Dover Publications. ,(1970)

Benoit Chachuat, Nonlinear and Dynamic Optimization: From Theory to Practice ,(2007)

Cecile DeWitt‐Morette, R. Abraham, J. E. Marsden, T. Ratiu, Manifolds, tensor analysis, and applications ,(1983)

Ruizhuo Song, Huaguang Zhang, The finite-horizon optimal control for a class of time-delay affine nonlinear system Neural Computing and Applications. ,vol. 22, pp. 229- 235 ,(2013) , 10.1007/S00521-011-0706-3

Travis Dierks, Balaje T. Thumati, S. Jagannathan, 2009 Special Issue: Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence Neural Networks. ,vol. 22, pp. 851- 860 ,(2009) , 10.1016/J.NEUNET.2009.06.014

10.

S. N. Balakrishnan, Victor Biega, Adaptive-critic based neural networks for aircraft optimal control Journal of Guidance Control and Dynamics. ,vol. 19, pp. 893- 898 ,(1996) , 10.2514/3.21715

Global optimality of approximate dynamic programming and its use in non-convex function minimization

来源期刊

我的账户

Global optimality of approximate dynamic programming and its use in non-convex function minimization

来源期刊

相似文章 10

我的账户