Error bound analysis of policy iteration based approximate dynamic programming for deterministic discrete-time nonlinear systems

作者: Wentao Guo , Feng Liu , Jennie Si , Shengwei Mei , Rui Li

DOI: 10.1109/IJCNN.2015.7280783

关键词:

摘要: Extensive approximate dynamic programming (ADP) algorithms have been developed based on policy iteration. For iteration ADP of deterministic discrete-time nonlinear systems, existing literature has proved its convergence in the formulation undiscounted value function under assumption exact approximation. Furthermore, error bound analyzed a discounted with consideration approximation errors. However, there not any analysis In this paper, we intend to fill theoretical gap. We provide sufficient condition error, so that iterative can be bounded neighbourhood optimal function. To best authors' knowledge, is first result for systems considering

参考文章(41)
John N. Tsitsiklis, Dimitri P. Bertsekas, Neuro-dynamic programming ,(1996)
Dimitri P. Bertsekas, Abstract Dynamic Programming ,(2013)
S.J. Bradtke, B.E. Ydstie, A.G. Barto, Adaptive linear quadratic control using policy iteration advances in computing and communications. ,vol. 3, pp. 3475- 3479 ,(1994) , 10.1109/ACC.1994.735224
Jennie Si, Andrew G Barto, Warren B Powell, Don Wunsch, Handbook of Learning and Approximate Dynamic Programming (2004). ,(2004) , 10.1109/9780470544785
Xianchao Sui, Yufei Tang, Haibo He, Jinyu Wen, Energy-Storage-Based Low-Frequency Oscillation Damping Control Using Particle Swarm Optimization and Heuristic Dynamic Programming IEEE Transactions on Power Systems. ,vol. 29, pp. 2539- 2548 ,(2014) , 10.1109/TPWRS.2014.2305977
Derong Liu, Qinglai Wei, Finite-Approximation-Error-Based Optimal Control Approach for Discrete-Time Nonlinear Systems IEEE Transactions on Systems, Man, and Cybernetics. ,vol. 43, pp. 779- 789 ,(2013) , 10.1109/TSMCB.2012.2216523
Russell Enns, Jennie Si, Helicopter Flight-Control Reconfiguration for Main Rotor Actuator Failures Journal of Guidance, Control, and Dynamics. ,vol. 26, pp. 572- 584 ,(2003) , 10.2514/2.5107
Qinglai Wei, Derong Liu, Adaptive Dynamic Programming for Optimal Tracking Control of Unknown Nonlinear Systems With Application to Coal Gasification IEEE Transactions on Automation Science and Engineering. ,vol. 11, pp. 1020- 1036 ,(2014) , 10.1109/TASE.2013.2284545