PIECEWISE CONSTANT REINFORCEMENT LEARNING FOR ROBOTIC APPLICATIONS

Andrea Bonarini , Alessandro Lazaric , Marcello Restelli
international conference on informatics in control, automation and robotics 214 -221

1
2007
Improving Batch Reinforcement Learning Performance through Transfer of Samples

Andrea Bonarini , Alessandro Lazaric , Marcello Restelli
starting ai researchers' symposium 106 -117

2008
Improving Cooperation among Self-Interested Reinforcement Learning Agents

Andrea Bonarini , Alessandro Lazaric , Marcello Restelli , Enrique Munoz de Cote
european conference on machine learning 1 -8

6
2005
On the Usefulness of Opponent Modeling: the Kuhn Poker case study (Short Paper)

Alessandro Lazaric , Mario Quaresimale , Marcello Restelli

2008
MESSI: Maximum Entropy Semi-Supervised Inverse Reinforcement Learning

Mohammad Ghavamzadeh , Alessandro Lazaric , Julien Audiffren , Michal Valko
neural information processing systems

2014
Finite-Sample Analysis of Lasso-TD

Mohammad Ghavamzadeh , Alessandro Lazaric , Matthew Hoffman , R mi Munos
international conference on machine learning 1177 -1184

46
2011
A truthful learning mechanism for multi-slot sponsored search auctions with externalities

Alessandro Lazaric , Nicola Gatti , Francesco Trovò
adaptive agents and multi-agents systems 227 1325 -1326

16
2012
Transfer of task representation in reinforcement learning using policy-based proto-value functions

Alessandro Lazaric , Marcello Restelli , Eliseo Ferrante
adaptive agents and multi-agents systems 3 1329 -1332

17
2008
Bifurcation Analysis of Reinforcement Learning Agents

A. Lazaric , F. Dercole , M. Restelli , E. Munoz de Cote
Springer Verlag

2008
Stochastic Optimization of a Locally Smooth Function under Correlated Bandit Feedback.

Alessandro Lazaric , Emma Brunskill , Mohammad Gheshlaghi Azar

6
2014
A Dantzig Selector Approach to Temporal Difference Learning

Mohammad Ghavamzadeh , Alessandro Lazaric , Bruno Scherrer , Matthieu Geist
international conference on machine learning 347 -354

22
2012
Semi-Supervised Apprenticeship Learning

Mohammad Ghavamzadeh , Alessandro Lazaric , Michal Valko
european workshop on reinforcement learning 24 131 -142

9
2012
A Learning Approach to Dynamic Coalition Formation

A. Bonarini , A Lazaric , M Restelli , DE Cote E Munoz
adaptive and learning agents 1 -7

2007
Bayesian Multi-Task Reinforcement Learning

Mohammad Ghavamzadeh , Alessandro Lazaric
international conference on machine learning 599 -606

65
2010
Classification-based Policy Iteration with a Critic

Mohammad Ghavamzadeh , Victor Gabillon , Alessandro Lazaric , Bruno Scherrer
international conference on machine learning 1049 -1056

20
2011
Finite-Sample Analysis of Bellman Residual Minimization

Mohammad Ghavamzadeh , Alessandro Lazaric , Rémi Munos , Odalric-Ambrym Maillard
asian conference on machine learning 299 -314

20
2010
Risk-Aversion in Multi-armed Bandits

Alessandro Lazaric , Rémi Munos , Amir Sani
neural information processing systems 25 3275 -3283

64
2012
Incremental Spectral Sparsification for Large-Scale Graph-Based Semi-Supervised Learning

Ioannis Koutis , Alessandro Lazaric , Daniele Calandriello , Michal Valko
arXiv: Machine Learning

1
2016
Improved Learning Complexity in Combinatorial Pure Exploration Bandits

Mohammad Ghavamzadeh , Victor Gabillon , Alessandro Lazaric , Peter L. Bartlett
international conference on artificial intelligence and statistics 1004 -1012

23
2016