作者: Jan Peters , Hany Abdulsamad , Samuele Tosatto , Joao Carvalho
DOI:
关键词:
摘要: Reinforcement learning (RL) algorithms still suffer from high sample complexity despite outstanding recent successes. The need for intensive interactions with the environment is …