作者: Yu Jiang , Zhong-Ping Jiang
DOI: 10.1109/TCSII.2012.2213353
关键词:
摘要: … of robust adaptive dynamic programming and the policy iteration technique. An iterative control … [31] PJ Werbos, “A menu of designs for reinforcement learning over time,” in Neural …