作者: Matthew E. Taylor , Gabriel Victor de la Cruz , Yunshu Du
DOI:
关键词:
摘要: Deep Reinforcement Learning (DRL) algorithms are known to be data inefficient. One reason is that a DRL agent learns both the feature and the policy tabula rasa. Integrating …