Rémi Munos

机构: Google DeepMind

主页: inria.fr

每年引用次数

引用次数

引用: 40,083

H-指数: 88

I10-指数 : 192

出版物: 329

Manjesh Kumar Hanawal , Venkatesh Saligrama , Michal Valko , R ' emi Munos
arXiv: Learning

2015

Modification of uct with patterns in monte-carlo go

Sylvain Gelly—Yizao Wang—Rémi Munos , Olivier Teytaud
Technical Report RR-6062 32 30 -56

2006

Pure exploration for multi-armed bandit problems

R MUNOS , S BUBECK , G STOLTZ
Lecture Notes in Computer Science

2009

An anti-diffusive scheme for viability problems

Sophie MARTIN , Remi MUNOS , Hasnaa ZIDANI

A general convergence method for Reinforcement Learning in the continuous case

R emi Munos

Combining variable resolution discretization criteria for high-accuracy solutions of continuous time and space MDPs

R emi Munos , Andrew Moore

Rates of Convergence for Variable Resolution Schemes in Optimal Control

R emi Munos , Andrew W Moore

Error bounds for approximate value iteration

Rémi Munos
national conference on artificial intelligence 1006 -1011

2005

Reinforcement Learning with a Near Optimal Rate of Convergence

Mohammad Gheshlaghi Azar , Rémi Munos , Mohammad Ghavamzadeh , Hilbert Kappen

2011

Reinforcement learning with dynamic covering of state-action space: partitioning Q-learning

Rémi Munos , Jocelyn Patinel
simulation of adaptive behavior 354 -363

1994

Rates of Convergence for Variable Resolution Schemes in Optimal Control

Rémi Munos , Andrew W. Moore
international conference on machine learning 647 -654

2000

L'Ordinateur, champion de go ?

Rémi Munos , Sylvain Gelly
Pour la science 354 ( 354) 28 -35

2007

Algorithmic Learning Theory : 24th International Conference, ALT 2013, Singapore, October 6-9, 2013. Proceedings

Rémi Munos , Frank Stephan , Sanjay Jain , Thomas Zeugmann
Springer Berlin Heidelberg

2013

Brownian Motions and Scrambled Wavelets for Least-Squares Regression

Rémi Munos , Odalric-Ambrym Maillard
13

2010

Variance estimates and exploration function in multi-armed bandit

Csaba Szepesvári , Rémi Munos , Jean-Yves Audibert

2008

Recent Advances in Reinforcement Learning

Rémi Munos , Daniil Ryabko , Philippe Preux , Sertan Girgin

2009

A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences

Rémi Munos , Odalric-Ambrym Maillard , Gilles Stoltz
conference on learning theory 18

2011

Adaptive play in Texas Hold'em Poker

Rémi Munos , Raphaël Maîtrepierre , Jérémie Mary
european conference on artificial intelligence 458 -462

2008

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning.

Rémi Munos
international conference on machine learning 337 -345

1996

Recent Advances in Reinforcement Learning: 8th European Workshop, EWRL 2008, Villeneuve d'Ascq, France, June 30-July 3, 2008, Revised and Selected Papers

Rémi Munos , Daniil Ryabko , Philippe Preux , Sertan Girgin
Springer-Verlag

2008

Reinforcement learning

RLHF

MCTS

bandit theory

statistical learning

Rémi Munos

引用次数

出版物: 329

我的账户