Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems

作者： Wentao Guo , Jennie Si , Feng Liu , Shengwei Mei

DOI: 10.1109/TNNLS.2017.2702566

关键词:

摘要: … function has not been well addressed when policy … , we study policy iteration algorithms with control policy approximation errors for approximate optimal control of discrete-time systems. …

参考文章(53)

Haibo He, Zhen Ni, Jian Fu, A three-network architecture for on-line learning and optimization based on adaptive dynamic programming Neurocomputing. ,vol. 78, pp. 3- 13 ,(2012) , 10.1016/J.NEUCOM.2011.05.031

Alok Kanti Deb, Jayadeva, Madan Gopal, Suresh Chandra, SVM-Based Tree-Type Neural Networks as a Critic in Adaptive Critic Designs for Control IEEE Transactions on Neural Networks. ,vol. 18, pp. 1016- 1030 ,(2007) , 10.1109/TNN.2007.899255

Vasilios N. Katsikis, Dimitrios Pappas, Athanassios Petralias, An improved method for the computation of the Moore―Penrose inverse matrix Applied Mathematics and Computation. ,vol. 217, pp. 9828- 9834 ,(2011) , 10.1016/J.AMC.2011.04.080

Casimir Lesiak, Arthur Krener, The existence and uniqueness of Volterra series for nonlinear systems conference on decision and control. ,vol. 16, pp. 271- 274 ,(1977) , 10.1109/CDC.1977.271584

Silvia Ferrari, Robert F. Stengel, Online Adaptive Critic Flight Control Journal of Guidance Control and Dynamics. ,vol. 27, pp. 777- 786 ,(2004) , 10.2514/1.12597

D.V. Prokhorov, D.C. Wunsch, Adaptive critic designs IEEE Transactions on Neural Networks. ,vol. 8, pp. 997- 1007 ,(1997) , 10.1109/72.623201

J.N. Tsitsiklis, B. Van Roy, An analysis of temporal-difference learning with function approximation IEEE Transactions on Automatic Control. ,vol. 42, pp. 674- 690 ,(1997) , 10.1109/9.580874

Kiho Kim, Sung Bae Kim, E.J. Powers, R.W. Miksad, F.J. Fischer, Adaptive second-order Volterra filtering and its application to second-order drift phenomena IEEE Journal of Oceanic Engineering. ,vol. 19, pp. 183- 192 ,(1994) , 10.1109/48.286640

G.K. Venayagamoorthy, R.G. Harley, D.C. Wunsch, Dual heuristic programming excitation neurocontrol for generators in a multimachine power system ieee industry applications society annual meeting. ,vol. 39, pp. 382- 394 ,(2001) , 10.1109/TIA.2003.809438

10.

Frank L Lewis, Draguna Vrabie, Kyriakos G Vamvoudakis, Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers IEEE Control Systems Magazine. ,vol. 32, pp. 76- 105 ,(2012) , 10.1109/MCS.2012.2214134

Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems

来源期刊

我的账户

Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems

来源期刊

相似文章 10

我的账户