2009 Special Issue: Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence

作者： Travis Dierks , Balaje T. Thumati , S. Jagannathan

DOI: 10.1016/J.NEUNET.2009.06.014

关键词:

摘要: The optimal control of linear systems accompanied by quadratic cost functions can be achieved solving the well-known Riccati equation. However, nonlinear discrete-time is a much more challenging task that often requires Hamilton-Jacobi-Bellman (HJB) In recent literature, approximate dynamic programming (ADP) techniques have been widely used to determine or near policies for affine systems. an inherent assumption ADP value controlled system one step ahead and at least partial knowledge dynamics known. this work, need relaxed in development novel approach using two part process: online identification offline training. First, process, neural network (NN) tuned tuning laws learn complete plant so local asymptotic stability error shown. Then, only learned NN model, attempted resulting law. proposed scheme does not require explicit as model needed. proof convergence demonstrated. Simulation results verify theoretical conjecture.

uni-trier.de 本地加速

sciencedirect.com 本地加速

参考文章(10)

Andrew G. Barto, Warren Buckler Powell, Jennie Si, Don Wunsch, Handbook of Learning and Approximate Dynamic Programming (IEEE Press Series on Computational Intelligence) Wiley-IEEE Press. ,(2004)

Jagannathan Sarangapani, Neural Network Control of Nonlinear Discrete-Time Systems ,(2006)

T. Hayakawa, W.M. Haddad, N. Hovakimyan, Neural Network Adaptive Control for a Class of Nonlinear Uncertain Dynamical Systems With Asymptotic Stability Guarantees IEEE Transactions on Neural Networks. ,vol. 19, pp. 80- 89 ,(2008) , 10.1109/TNN.2007.902704

A. Al-Tamimi, F.L. Lewis, M. Abu-Khalaf, Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof systems man and cybernetics. ,vol. 38, pp. 943- 949 ,(2008) , 10.1109/TSMCB.2008.926614

Travis Dierks, Balaje T. Thumati, S. Jagannathan, Adaptive dynamic programming-based optimal control of unknown affine nonlinear discrete-time systems international joint conference on neural network. pp. 1368- 1373 ,(2009) , 10.1109/IJCNN.2009.5178776

D.V. Prokhorov, D.C. Wunsch, Adaptive critic designs IEEE Transactions on Neural Networks. ,vol. 8, pp. 997- 1007 ,(1997) , 10.1109/72.623201

Zheng Chen, S. Jagannathan, Generalized Hamilton–Jacobi–Bellman Formulation -Based Neural Network Control of Affine Nonlinear Discrete-Time Systems IEEE Transactions on Neural Networks. ,vol. 19, pp. 90- 106 ,(2008) , 10.1109/TNN.2007.900227

J.J. Murray, C.J. Cox, G.G. Lendaris, R. Saeks, Adaptive dynamic programming systems man and cybernetics. ,vol. 32, pp. 140- 153 ,(2002) , 10.1109/TSMCC.2002.801727

Jagannathan Sarangapani, Network Control of Nonlinear Discrete Time Systems ,(2006)

10.

Frank L. Lewis, Optimal Control ,(1986)

2009 Special Issue: Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence

来源期刊

我的账户

2009 Special Issue: Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence

来源期刊

相似文章 10

我的账户