Learning temporally-consistent representations for data-efficient reinforcement learning

作者: Trevor McInroe , Lukas Schäfer , Stefano V Albrecht

DOI:

关键词:

摘要: Deep reinforcement learning (RL) agents that exist in high-dimensional state spaces, such as those composed of images, have interconnected learning burdens. Agents must learn an …

参考文章(0)