Some Experiments with Case-Based Search.

Steven Bradtke , Wendy G Lehnert
AAAI 133 -138

38
1988
Learning to Solve Stochastic Optimal Path Problems Using Real-Time Dynamic Programming

AG Barto , SJ Bradtke
The Proceedings of the Seventh Yale Workshop on Adaptive and Learning Systems 143 -148

1992
Reinforcement Learning Applied to Linear Quadratic Regulation

Steven J. Bradtke
neural information processing systems 5 295 -302

83
1992
Reinforcement Learning Methods for Continuous-Time Markov Decision Problems

Steven J. Bradtke , Michael O. Duff
neural information processing systems 7 393 -400

467
1994
Learning to act using real-time dynamic programming

Andrew G. Barto , Steven J. Bradtke , Satinder P. Singh
Artificial Intelligence 72 ( 1) 81 -138

1,590
1995
Linear Least-Squares Algorithms for Temporal Difference Learning

Steven J. Bradtke , Andrew G. Barto
Machine Learning 22 ( 1-3) 33 -57

914
1996
Adaptive linear quadratic control using policy iteration

Steven J Bradtke , B Erik Ydstie , Andrew G Barto
Proceedings of 1994 American Control Conference-ACC'94 3 3475 -3479

508
1994
Real-time learning and control using asynchronous dynamic programming

Andrew Gehret Barto , Steven J Bradtke , Satinder P Singh
University of Massachusetts at Amherst, Department of Computer and Information Science

237
1991