作者: Sébastien Bubeck , Rémi Munos , Gilles Stoltz
DOI: 10.1007/978-3-642-04414-4_7
关键词:
摘要: … We consider the framework of stochastic multi-armed bandit problems and study the possibilities and limitations of strategies that perform an online exploration of the arms. The …