Cheap Bandits

Manjesh Kumar Hanawal , Venkatesh Saligrama , Michal Valko , R ' emi Munos
arXiv: Learning

14
2015
Modification of uct with patterns in monte-carlo go

Sylvain Gelly—Yizao Wang—Rémi Munos , Olivier Teytaud
Technical Report RR-6062 32 30 -56

9
2006
Pure exploration for multi-armed bandit problems

R MUNOS , S BUBECK , G STOLTZ
Lecture Notes in Computer Science

1
2009
An anti-diffusive scheme for viability problems

Sophie MARTIN , Remi MUNOS , Hasnaa ZIDANI

Error bounds for approximate value iteration

Rémi Munos
national conference on artificial intelligence 1006 -1011

57
2005
Reinforcement Learning with a Near Optimal Rate of Convergence

Mohammad Gheshlaghi Azar , Rémi Munos , Mohammad Ghavamzadeh , Hilbert Kappen

21
2011
Reinforcement learning with dynamic covering of state-action space: partitioning Q-learning

Rémi Munos , Jocelyn Patinel
simulation of adaptive behavior 354 -363

12
1994
Rates of Convergence for Variable Resolution Schemes in Optimal Control

Rémi Munos , Andrew W. Moore
international conference on machine learning 647 -654

19
2000
L'Ordinateur, champion de go ?

Rémi Munos , Sylvain Gelly
Pour la science 354 ( 354) 28 -35

2007
Algorithmic Learning Theory : 24th International Conference, ALT 2013, Singapore, October 6-9, 2013. Proceedings

Rémi Munos , Frank Stephan , Sanjay Jain , Thomas Zeugmann
Springer Berlin Heidelberg

2013
2
2010
Variance estimates and exploration function in multi-armed bandit

Csaba Szepesvári , Rémi Munos , Jean-Yves Audibert

32
2008
Recent Advances in Reinforcement Learning

Rémi Munos , Daniil Ryabko , Philippe Preux , Sertan Girgin

1
2009
A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences

Rémi Munos , Odalric-Ambrym Maillard , Gilles Stoltz
conference on learning theory 18

70
2011
Adaptive play in Texas Hold'em Poker

Rémi Munos , Raphaël Maîtrepierre , Jérémie Mary
european conference on artificial intelligence 458 -462

11
2008
11
1996