Dyna, an integrated architecture for learning, planning, and reacting

作者: Richard S. Sutton

DOI: 10.1145/122344.122377

关键词: Robot learningAction (philosophy)Active learning (machine learning)Algorithmic learning theoryArchitectureComputer scienceArtificial intelligenceProbabilistic logicComputational learning theoryStrengths and weaknesses

摘要: … The structure and learning of the action model lie mostly outside the the scope of the Dyna … the scope of the Dyna architecture per se. Because Dyna makes no strong assumptions …

参考文章(18)
Rick L. Riolo, Lookahead planning and latent learning in a classifier system simulation of adaptive behavior. pp. 316- 326 ,(1991)
Long-Ji Lin, Self-improving reactive agents: case studies of reinforcement learning frameworks simulation of adaptive behavior. pp. 297- 305 ,(1991)
Richard Stuart Sutton, Temporal credit assignment in reinforcement learning University of Massachusetts Amherst. ,(1984)
A. G. Barto, R. S. Sutton, C. J.C.H. Watkins, Learning and Sequential Decision Making University of Massachusetts. ,(1989)
Stuart J. Russell, Execution architectures and compilation international joint conference on artificial intelligence. pp. 15- 20 ,(1989)
Leslie Pack Kaelbling, Nils J. Nilsson, Learning in Embedded Systems ,(1993)
Kenneth James Williams Craik, The nature of explanation ,(1943)