Introduction to the special issue on learning theory

GraepelThore , HerbrichRalf
Journal of Machine Learning Research

2003
TRAINING A POLICY NEURAL NETWORK AND A VALUE NEURAL NETWORK

Ilya Sutskever , Thore Kurt Hartwig Graepel , Christopher Maddison , David Silver

13
2018
Parallelization of online learning algorithms

Taha Bekir Eren , Oleg Isakov , Weizhu Chen , Jeffrey Scott Dunn

40
2014
Feature vector construction

Gjergji Kasneci , David Hector Stern , Thore Kurt Hartwig Graepel , Ralf Herbrich

50
2012
Training a policy neural network for controlling an agent using best response policy iteration

Thomas William Anthony , Thomas Edward Eccles , Andrea Tacchetti , János Kramár

2
2022
Selecting actions to be performed by a reinforcement learning agent using tree search

Thore Kurt Hartwig Graepel , Shih-Chieh Huang , David Silver , Arthur Clement Guez

23
2020
Presenting content items using topical relevance and trending popularity

David Stern , Ralf Herbrich , Milad Shokouhi , Thore Kurt Hartwig Graepel

142
2011
Database access

Andrew Donald Gordon , Thore Kurt Hartwig Graepel , Nicolas Philippe Marie Rolland , Eric Johannes Borgstrom

24
2016
Relational database management

Sameer Singh , Thore Kurt Hartwig Graepel , Lucas Julien Bordeaux , Andrew Donald Gordon

10
2020
Jointly updating agent control policies using estimated best responses to current control policies

Luke Christopher Marris , Paul Fernand Michel Muller , Marc Lanctot , Thore Kurt Hartwig Graepel

2024
Selecting points in continuous spaces using neural networks

Thomas Edward Eccles , Ian Michael Gemp , János Kramár , Marta Garnelo Abellanas

2022
Neural network architecture for efficient resource allocation

Andrea Tacchetti , Daniel Joseph Strouse , Marta Garnelo Abellanas , Thore Kurt Hartwig Graepel

2022
Knowledge corroboration

Gjergji Kasneci , Jurgen Anne Francois Marie Van Gael , Thore Kraepel , Ralf Herbrich

29
2014
Stereo video for gaming

Thore KH Graepel , Andrew Blake , Ralf Herbrich

197
2012
Bayesian scoring

Thore KH Graepel , Ralf Herbrich

112
2006
Seeding in a skill scoring framework

Ralf Herbrich , Thore KH Graepel

87
2013
Dependency structure from temporal data

Thore KH Graepel , Ralf Herbrich , Shyansundar Rajaram

59
2010
Player ranking with partial information

Thore KH Graepel , Rafl Herbrich

57
2010
Bayesian scoring

Thore KH Graepel , Ralf Herbrich

47
2008
Determining relative skills of players

Thomas Minka , Thore KH Graepel , Ralf Herbrich

29
2013