gym-DSSAT: a crop model turned into a Reinforcement Learning environment

Romain Gautron , Emilio J Padrón , Philippe Preux , Julien Bigot
arXiv preprint arXiv:2207.03270

6
2022
Learning crop management by reinforcement: gym-DSSAT

Romain Gautron , Emilio J Padrón , Philippe Preux , Julien Bigot
Smpte Journal

2023
Farm-gym: A modular reinforcement learning platform for stochastic agronomic games

Odalric-Ambrym Maillard , Timothée Mathieu , Debabrota Basu
Smpte Journal

2023
Reinforcement learning for crop management support: Review, prospects and challenges

Romain Gautron , Odalric-Ambrym Maillard , Philippe Preux , Marc Corbeels
Computers and Electronics in Agriculture 200 107182

4
2022
Csaba Szepesvári University of Alberta

Alborz Geramifard , Alessandro Lazaric , Amir-massoud Farahmand , Andre Damotta Salles

2012
Finite-sample Analysis of Bellman Residual Minimization

Odalric-Ambrym Maillard , Rémi Munos , Alessandro Lazaric , Mohammad Ghavamzadeh
ACML 299 -314

46
2010
Streaming kernel regression with provably adaptive mean, variance, and regularization

Audrey Durand , Odalric-Ambrym Maillard , Joelle Pineau
arXiv preprint arXiv:1708.00768

12
2017
Méthodes des moments pour l’inférence de systèmes séquentiels linéaires rationnels

Marc Tommasi , François Denis , Joëlle Pineau , Odalric-Ambrym Maillard
Université Lille 1

2016
Collaborative algorithms for online personalized mean estimation

Mahsa Asadi , Aurélien Bellet , Odalric-Ambrym Maillard , Marc Tommasi
arXiv preprint arXiv:2208.11530

3
2022
Monte-Carlo tree search with uncertainty propagation via optimal transport

Tuan Dam , Pascal Stenger , Lukas Schneider , Joni Pajarinen
arXiv preprint arXiv:2309.10737

1
2023
Exploration in Reward Machines with Low Regret

Hippolyte Bourel , Anders Jonsson , Odalric-Ambrym Maillard , Mohammad Sadegh Talebi
International Conference on Artificial Intelligence and Statistics 4114 -4146

4
2023
Sub-sampling for multi-armed bandits

Akram Baransi , Odalric-Ambrym Maillard , Shie Mannor
Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2014, Nancy, France, September 15-19, 2014. Proceedings, Part I 14 115 -131

65
2014
Parallelization of the TD (λ) Learning Algorithm

Odalric-Ambrym Maillard , Rémi Coulom , Philippe Preux
The Seventh European Workshop on Reinforcement Learning

5
2005
AdaStop: sequential testing for efficient and reliable comparisons of Deep RL Agents

Timothée Mathieu , Riccardo Della Vecchia , Alena Shilova , Matheus Centa de Medeiros
arXiv preprint arXiv:2306.10882

2023
Contextual bandits to help patient follow-up

Emilie Kaufmann , Odalric-Ambrym Maillard , Timothée Mathieu , Philippe Preux

Compressed least-squares regression

Odalric Maillard , Rémi Munos ,
Advances in Neural Information Processing Systems

139
2009
LSTD with Random Projections

Mohammad Ghavamzadeh , Alessandro Lazaric , Odalric-Ambrym Maillard , Rémi Munos
neural information processing systems 23 721 -729

45
2010
Low-rank bandits with latent mixtures

Aditya Gopalan , Odalric-Ambrym Maillard , Mohammadi Zaki
arXiv preprint arXiv:1609.01508

33
2016