Reinforcement learning with supervision by a stable controller

Rosenstein , Barto
american control conference 5 4517 -4522

2004
Pattern-recognizing stochastic learning automata

Andrew. G. Barto , P. Anandan
systems man and cybernetics 15 ( 3) 360 -375

484
1985
A Neural Signature of Hierarchical Reinforcement Learning

José J.F. Ribas-Fernandes , Alec Solway , Carlos Diuk , Joseph T. McGuire
Neuron 71 ( 2) 370 -379

138
2011
DISCRETE AND CONTINUOUS MODELS

ANDREW G. BARTO
International Journal of General Systems 4 ( 3) 163 -177

1978
Controversies in neurosciences IV: Motor learning and synaptic plasticity in the cerebellum: Introduction

C Bell , P Cordo , JC HOUK , JT BUCKINGHAM
Behavioral and brain sciences (Print) 19 ( 3) 339 -527

14
1996
Controlling a nonlinear spring-mass system with a cerebellar model

Jay T Buckingham , James C Houk , JG Barto
Proceedings of the Eighth Yale Workshop on Adaptive and Learning Systems 1 -6

8
1994
A neural network simulation method using the fast Fourier transform

ANDREW G Barto
IEEE Trans. on Systems, Man, and Cybernetics 863 -867

8
1976
Functional mechanisms of motor skill acquisition

Ashvin Shah , Andrew G Barto
BMC Neuroscience 8 ( 2) 203

2
2007
Variable risk control via stochastic optimization

Scott R Kuindersma , Roderic A Grupen , Andrew G Barto
The International Journal of Robotics Research 32 ( 7) 806 -825

29
2013
Intrinsically Motivated Hierarchical Skill Learning in Structured Environments

Christopher M Vigorito , Andrew G Barto
IEEE Transactions on Autonomous Mental Development 2 ( 2) 132 -143

92
2010
Genetic Programming for Reward Function Search

Scott Niekum , Andrew G Barto , Lee Spector
IEEE Transactions on Autonomous Mental Development 2 ( 2) 83 -90

41
2010
Gradient following without back-propagation in layered networks

Andrew G Barto , Michael I Jordan ,
Smpte Journal 2

205
1987
Some recent applications of reinforcement learning

Andrew G Barto , Philip S Thomas , Richard S Sutton
Smpte Journal

39
2017
A temporal-difference model of classical conditioning

Richard S Sutton , Andrew G Barto
Smpte Journal 355 -378

264
1987
Variational Bayesian Optimization for Runtime Risk-Sensitive Control

Priyanshu Agarwal , Suren Kumar , Julian Ryde , Jason Corso
MIT Press 201 -208

2013