作者: Alessandro Allievi , Peter Stone , W. Bradley Knox , Holger Banzhaf , Felix Schmitt
DOI:
关键词:
摘要: This paper considers the problem of reward design for autonomous driving (AD), with insights that are also applicable to cost functions and performance metrics more generally. Herein we develop 8 simple sanity checks identifying flaws in functions. The applied from past work on reinforcement learning (RL) driving, revealing near-universal AD might exist pervasively across other tasks. Lastly, explore promising directions may help future researchers AD.