作者: Matteo Leonetti , Bei Peng , Peter Stone , Matthew E. Taylor , Sanmit Narvekar
DOI:
关键词:
摘要: Reinforcement learning (RL) is a popular paradigm for addressing sequential decision tasks in which the agent has only limited environmental feedback. Despite many advances over …