Where to Add Actions in Human-in-the-Loop Reinforcement Learning

作者: Emma Brunskill , Zoran Popović , Yun-En Liu , Travis Mandel

DOI:

关键词:

摘要: … human-inthe-loop reinforcement learning, where an automated agent leverages human intuition to learn effectively in vast action … at which to develop a new action, and present a new …

参考文章(23)
Cynthia Breazeal, Andrea L. Thomaz, Reinforcement learning with human teachers: evidence of feedback and guidance with implications for learning performance national conference on artificial intelligence. pp. 1000- 1005 ,(2006)
J. A. Clouse, An Introspection Approach to Querying a Trainer University of Massachusetts. ,(1996)
Michael L. Littman, Bethany R. Leffler, Timothy Edmunds, Efficient reinforcement learning with relocatable action models national conference on artificial intelligence. pp. 572- 577 ,(2007)
Malcolm J. A. Strens, A Bayesian Framework for Reinforcement Learning international conference on machine learning. pp. 943- 950 ,(2000)
Nathan Korda, David S. Leslie, Anthony Lee, Benedict C. May, Optimistic Bayesian sampling in contextual-bandit problems Journal of Machine Learning Research. ,vol. 13, pp. 2069- 2106 ,(2012) , 10.5555/2188385.2343711
Alexander L. Strehl, Michael L. Littman, An analysis of model-based Interval Estimation for Markov Decision Processes Journal of Computer and System Sciences. ,vol. 74, pp. 1309- 1331 ,(2008) , 10.1016/J.JCSS.2007.08.009
Tao Wang, Daniel Lizotte, Michael Bowling, Dale Schuurmans, Bayesian sparse sampling for on-line reward optimization Proceedings of the 22nd international conference on Machine learning - ICML '05. pp. 956- 963 ,(2005) , 10.1145/1102351.1102472
Charles Isbell, Shane Griffith, Jonathan Scholz, Andrea L Thomaz, Kaushik Subramanian, Policy Shaping: Integrating Human Feedback with Reinforcement Learning neural information processing systems. ,vol. 26, pp. 2625- 2633 ,(2013)
David Silver, J. Andrew Bagnell, Anthony Stentz, Active learning from demonstration for robust autonomous navigation international conference on robotics and automation. pp. 200- 207 ,(2012) , 10.1109/ICRA.2012.6224757