作者: W Thomas Miller , Richard S Sutton , Paul J Werbos
DOI:
关键词: Error-driven learning 、 Reinforcement learning 、 Programming language 、 Code (cryptography) 、 Simple (abstract algebra) 、 Action (philosophy) 、 Computer science 、 Dynamic programming
摘要: This chapter contains sections titled: Introduction and Overview, A Simple Two-Component Adaptive Critic Design, HDP and Dynamic Programming, Alternative Ways to Figure 3.2 in …