Algorithms or Actions? A Study in Large-Scale Reinforcement Learning.

作者: Anderson Rocha Tavares , Sivasubramanian Anbalagan , Leandro Soriano Marcolino , Luiz Chaimowicz

DOI:

关键词:

摘要: Large state and action spaces are very challenging to reinforcement learning. However, in many domains there is a set of algorithms available, which estimate the best action given a state. Hence, agents can either directly learn a performance-maximizing mapping from states to actions, or from states to algorithms. We investigate several aspects of this dilemma, showing sufficient conditions for learning over algorithms to outperform over actions for a finite number of training iterations. We present synthetic experiments to further study such systems. Finally, we propose a function approximation approach, demonstrating the effectiveness of learning over algorithms in real-time strategy games.

参考文章(1)
ALBERT XIN JIANG, LEANDRO SORIANO MARCOLINO, ARIEL D PROCACCIA, TUOMAS SANDHOLM, NISARG SHAH, MILIND TAMBE, Diverse Randomized Agents Vote to Win NIPS. ,(2014)