作者: Scott Niekum , Raymond J. Mooney , Prasoon Goyal
DOI:
关键词: Reinforcement learning 、 Natural language 、 Domain (software engineering) 、 Computer science 、 Machine learning 、 Pixel 、 Artificial intelligence 、 Structure (mathematical logic) 、 Robot 、 Task (project management) 、 Sample (statistics)
摘要: Reinforcement learning (RL), particularly in sparse reward settings, often requires prohibitively large numbers of interactions with the environment, thereby limiting its …