作者: Scott Niekum , Alessandro Allievi , Peter Stone , W. Bradley Knox , Yuchen Cui
DOI:
关键词: Leverage (statistics) 、 Robot 、 Computer science 、 Artificial neural network 、 Teaching method 、 Task learning 、 Facial expression 、 Human–computer interaction 、 Gesture
摘要: … proxies for a reaction mapping by watching the reactions of the … their inferred reward, which we refer to as the rewardranking … This work focuses on predicting task statistics from human …