Global optimality of approximate dynamic programming and its use in non-convex function minimization

作者: Ali Heydari , S.N. Balakrishnan

DOI: 10.1016/J.ASOC.2014.07.003

关键词:

摘要: Level curves of the Rosenbrock function subject to minimization and state trajectories for different initial conditions x0ź{-2, -1, 0, 1, 2}×{-2, 2}. The red plus signs denote point respective trajectory. This study investigates global optimality approximate dynamic programming (ADP) based solutions using neural networks optimal control problems with fixed final time. Issues including whether or not cost terms system dynamics need be convex functions respect their inputs are discussed sufficient result derived. Next, a new idea is presented use ADP optimization non-convex smooth functions. It shown that any guess leads direct movement toward proximity optimum function. behavior in contrast gradient methods which guided by shape local level curves. Illustrative examples provided single multi-variable demonstrate potential proposed method.

参考文章(33)
William F. Trench, Introduction to Real Analysis ,(1982)
Michael Stone, Paul Goldbart, Mathematics for Physics Cambridge University Press. ,(2009) , 10.1017/CBO9780511627040
Chi-Tsong Chen, Linear System Theory and Design ,(1995)
Donald E. Kirk, Optimal control theory : an introduction Dover Publications. ,(1970)
Cecile DeWitt‐Morette, R. Abraham, J. E. Marsden, T. Ratiu, Manifolds, tensor analysis, and applications ,(1983)
Ruizhuo Song, Huaguang Zhang, The finite-horizon optimal control for a class of time-delay affine nonlinear system Neural Computing and Applications. ,vol. 22, pp. 229- 235 ,(2013) , 10.1007/S00521-011-0706-3
S. N. Balakrishnan, Victor Biega, Adaptive-critic based neural networks for aircraft optimal control Journal of Guidance Control and Dynamics. ,vol. 19, pp. 893- 898 ,(1996) , 10.2514/3.21715