Active Reward Learning from Critiques

Yuchen Cui , Scott Niekum
2018 IEEE International Conference on Robotics and Automation (ICRA) 6907 -6914

69
2018
Incremental Task Modification via Corrective Demonstrations

Reymundo A. Gutierrez , Vivian Chu , Andrea L. Thomaz , Scott Niekum
international conference on robotics and automation 1126 -1133

6
2018
Enhancing robot learning with human social cues

Akanksha Saran , Elaine Schaertl Short , Andrea Thomaz , Scott Niekum
human-robot interaction 745 -747

2
2019
Learning from corrective demonstrations

Reymundo A. Gutierrez , Elaine Schaertl Short , Scott Niekum , Andrea L. Thomaz
human-robot interaction 712 -714

3
2019
Uncertainty-Aware Data Aggregation for Deep Imitation Learning

Yuchen Cui , David Isele , Scott Niekum , Kikuo Fujimura
2019 International Conference on Robotics and Automation (ICRA) 761 -767

8
2019
Learning pouring skills from demonstration and practice

Akihiko Yamaguchi , Christopher G. Atkeson , Scott Niekum , Tsukasa Ogasawara
ieee-ras international conference on humanoid robots 908 -915

9
2014
Learning and generalization of complex tasks from unstructured demonstrations

Scott Niekum , Sarah Osentoski , George Konidaris , Andrew G. Barto
intelligent robots and systems 5239 -5246

130
2012
Viewpoint selection for visual failure detection

Akanksha Saran , Branka Lakic , Srinjoy Majumdar , Juergen Hess
2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 5437 -5444

4
2017
Classification error correction: A case study in brain-computer interfacing

Hasan A. Poonawala , Mohammed Alshiekh , Scott Niekum , Ufuk Topcu
2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 3006 -3012

2017
Human Gaze Following for Human-Robot Interaction

Akanksha Saran , Srinjoy Majumdar , Elaine Schaertl Short , Andrea Thomaz
2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 8615 -8621

10
2018
Asking for Help Effectively via Modeling of Human Beliefs

Taylor Kessler Faulkner , Scott Niekum , Andrea Thomaz
human robot interaction 149 -150

1
2018
Fairness guarantees under demographic shift

Stephen Giguere , Blossom Metevier , Yuriy Brun , Bruno Castro da Silva
Smpte Journal

10
2022
The perils of trial-and-error reward design: misdesign through overfitting and invalid task specifications

Serena Booth , W Bradley Knox , Julie Shah , Scott Niekum
Smpte Journal 37 ( 5) 5920 -5929

2023
Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?

Yuchen Cui , Scott Niekum , Abhinav Gupta , Vikash Kumar
Smpte Journal 893 -905

11
2022
You only evaluate once: a simple baseline algorithm for offline rl

Wonjoon Goo , Scott Niekum
Smpte Journal 1543 -1553

9
2022
Know your boundaries: The necessity of explicit behavioral cloning in offline rl

Wonjoon Goo , Scott Niekum
arXiv preprint arXiv:2206.00695

2022
A ranking game for imitation learning

Harshit Sikchi , Akanksha Saran , Wonjoon Goo , Scott Niekum
arXiv preprint arXiv:2202.03481

2
2022
Models of human preference for learning reward functions

W Bradley Knox , Stephane Hatgis-Kessell , Serena Booth , Scott Niekum
arXiv preprint arXiv:2206.02231

5
2022
Behavior Policy Gradient Supplemental Material

Josiah P Hanna , Philip S Thomas , Peter Stone , Scott Niekum
Smpte Journal

Sope: Spectrum of off-policy estimators

Christina Yuan , Yash Chandak , Stephen Giguere , Philip S Thomas
Advances in Neural Information Processing Systems 34 18958 -18969

4
2021