Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems

作者: Wentao Guo , Jennie Si , Feng Liu , Shengwei Mei

DOI: 10.1109/TNNLS.2017.2702566

关键词:

摘要: … function has not been well addressed when policy … , we study policy iteration algorithms with control policy approximation errors for approximate optimal control of discrete-time systems. …

参考文章(53)
Alok Kanti Deb, Jayadeva, Madan Gopal, Suresh Chandra, SVM-Based Tree-Type Neural Networks as a Critic in Adaptive Critic Designs for Control IEEE Transactions on Neural Networks. ,vol. 18, pp. 1016- 1030 ,(2007) , 10.1109/TNN.2007.899255
Vasilios N. Katsikis, Dimitrios Pappas, Athanassios Petralias, An improved method for the computation of the Moore―Penrose inverse matrix Applied Mathematics and Computation. ,vol. 217, pp. 9828- 9834 ,(2011) , 10.1016/J.AMC.2011.04.080
Casimir Lesiak, Arthur Krener, The existence and uniqueness of Volterra series for nonlinear systems conference on decision and control. ,vol. 16, pp. 271- 274 ,(1977) , 10.1109/CDC.1977.271584
Silvia Ferrari, Robert F. Stengel, Online Adaptive Critic Flight Control Journal of Guidance Control and Dynamics. ,vol. 27, pp. 777- 786 ,(2004) , 10.2514/1.12597
D.V. Prokhorov, D.C. Wunsch, Adaptive critic designs IEEE Transactions on Neural Networks. ,vol. 8, pp. 997- 1007 ,(1997) , 10.1109/72.623201
J.N. Tsitsiklis, B. Van Roy, An analysis of temporal-difference learning with function approximation IEEE Transactions on Automatic Control. ,vol. 42, pp. 674- 690 ,(1997) , 10.1109/9.580874
Kiho Kim, Sung Bae Kim, E.J. Powers, R.W. Miksad, F.J. Fischer, Adaptive second-order Volterra filtering and its application to second-order drift phenomena IEEE Journal of Oceanic Engineering. ,vol. 19, pp. 183- 192 ,(1994) , 10.1109/48.286640
G.K. Venayagamoorthy, R.G. Harley, D.C. Wunsch, Dual heuristic programming excitation neurocontrol for generators in a multimachine power system ieee industry applications society annual meeting. ,vol. 39, pp. 382- 394 ,(2001) , 10.1109/TIA.2003.809438
Frank L Lewis, Draguna Vrabie, Kyriakos G Vamvoudakis, Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers IEEE Control Systems Magazine. ,vol. 32, pp. 76- 105 ,(2012) , 10.1109/MCS.2012.2214134