2009 Special Issue: Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence

作者: Travis Dierks , Balaje T. Thumati , S. Jagannathan

DOI: 10.1016/J.NEUNET.2009.06.014

关键词:

摘要: The optimal control of linear systems accompanied by quadratic cost functions can be achieved solving the well-known Riccati equation. However, nonlinear discrete-time is a much more challenging task that often requires Hamilton-Jacobi-Bellman (HJB) In recent literature, approximate dynamic programming (ADP) techniques have been widely used to determine or near policies for affine systems. an inherent assumption ADP value controlled system one step ahead and at least partial knowledge dynamics known. this work, need relaxed in development novel approach using two part process: online identification offline training. First, process, neural network (NN) tuned tuning laws learn complete plant so local asymptotic stability error shown. Then, only learned NN model, attempted resulting law. proposed scheme does not require explicit as model needed. proof convergence demonstrated. Simulation results verify theoretical conjecture.

参考文章(10)
Andrew G. Barto, Warren Buckler Powell, Jennie Si, Don Wunsch, Handbook of Learning and Approximate Dynamic Programming (IEEE Press Series on Computational Intelligence) Wiley-IEEE Press. ,(2004)
T. Hayakawa, W.M. Haddad, N. Hovakimyan, Neural Network Adaptive Control for a Class of Nonlinear Uncertain Dynamical Systems With Asymptotic Stability Guarantees IEEE Transactions on Neural Networks. ,vol. 19, pp. 80- 89 ,(2008) , 10.1109/TNN.2007.902704
A. Al-Tamimi, F.L. Lewis, M. Abu-Khalaf, Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof systems man and cybernetics. ,vol. 38, pp. 943- 949 ,(2008) , 10.1109/TSMCB.2008.926614
Travis Dierks, Balaje T. Thumati, S. Jagannathan, Adaptive dynamic programming-based optimal control of unknown affine nonlinear discrete-time systems international joint conference on neural network. pp. 1368- 1373 ,(2009) , 10.1109/IJCNN.2009.5178776
D.V. Prokhorov, D.C. Wunsch, Adaptive critic designs IEEE Transactions on Neural Networks. ,vol. 8, pp. 997- 1007 ,(1997) , 10.1109/72.623201
Zheng Chen, S. Jagannathan, Generalized Hamilton–Jacobi–Bellman Formulation -Based Neural Network Control of Affine Nonlinear Discrete-Time Systems IEEE Transactions on Neural Networks. ,vol. 19, pp. 90- 106 ,(2008) , 10.1109/TNN.2007.900227
J.J. Murray, C.J. Cox, G.G. Lendaris, R. Saeks, Adaptive dynamic programming systems man and cybernetics. ,vol. 32, pp. 140- 153 ,(2002) , 10.1109/TSMCC.2002.801727
Frank L. Lewis, Optimal Control ,(1986)