W Bradley Knox

Guangliang Li , Hayley Hung , Shimon Whiteson , W. Bradley Knox
Development and Learning and Epigenetic Robotics (ICDL-Epirob), 2014 Joint IEEE International Conferences on 93 -100

2014

Domestic Interaction on a Segway Base

W. Bradley Knox , Juhyun Lee , Peter Stone
RoboCup 2008: Robot Soccer World Cup XII 519 -531

2009

Social interaction for efficient agent learning from human reward

Guangliang Li , Shimon Whiteson , W. Bradley Knox , Hayley Hung
Autonomous Agents and Multi-Agent Systems 32 ( 1) 1 -25

2018

Using informative behavior to increase engagement while learning from human reward

Guangliang Li , Shimon Whiteson , W. Bradley Knox , Hayley Hung
Autonomous Agents and Multi-Agent Systems 30 ( 5) 826 -848

2016

The perils of trial-and-error reward design: misdesign through overfitting and invalid task specifications

Serena Booth , W Bradley Knox , Julie Shah , Scott Niekum
Smpte Journal 37 ( 5) 5920 -5929

2023

Models of human preference for learning reward functions

W Bradley Knox , Stephane Hatgis-Kessell , Serena Booth , Scott Niekum
arXiv preprint arXiv:2206.02231

2022

Learning optimal advantage from preferences and mistaking it for reward

W Bradley Knox , Stephane Hatgis-Kessell , Sigurdur Orn Adalgeirsson , Serena Booth
Proceedings of the AAAI Conference on Artificial Intelligence 38 ( 9) 10066 -10073

2024

Contrastive prefence learning: Learning from human feedback without rl

Joey Hejna , Rafael Rafailov , Harshit Sikchi , Chelsea Finn
arXiv preprint arXiv:2310.13639

2023

Understanding human teaching modalities in reinforcement learning environments: A preliminary report

W Bradley Knox , Matthew E Taylor , Peter Stone
IJCAI 2011 Workshop on Agents Learning Interactively from Human Teachers (ALIHT)

2011

Training a robot via human feedback: A case study

W Bradley Knox , Peter Stone , Cynthia Breazeal
Social Robotics: 5th International Conference, ICSR 2013, Bristol, UK, October 27-29, 2013, Proceedings 5 460 -470

174

2013

Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance

W Bradley Knox , Peter Stone
Artificial Intelligence 225 24 -50

2015

Design Principles for Creating Human-Shapable Agents.

W Bradley Knox , Ian R Fasel , Peter Stone
AAAI Spring Symposium: Agents that Learn from Human Teachers 79 -86

2009

Teaching agents with human feedback: a demonstration of the tamer framework

W Bradley Knox , Peter Stone , Cynthia Breazeal
65 -66

2013

Learning from feedback on actions past and intended

W Bradley Knox , Cynthia Breazeal , Peter Stone
In Proceedings of 7th ACM/IEEE International Conference on Human-Robot Interaction, Late-Breaking Reports Session (HRI 2012)

2012

Interactively Shaping Agents via Human Feedback

W Bradley Knox , Peter Stone