Reinforcement Learning (Dagstuhl Seminar 13321)

Marcus Hutter , Peter Auer , Laurent Orseau
Dagstuhl Reports 3 ( 8) 26

2013
Safely interruptible agents

Stuart Armstrong , Laurent Orseau
uncertainty in artificial intelligence 557 -566

62
2016
Reinforcement Learning with a Corrupted Reward Channel

Tom Everitt , Marcus Hutter , Shane Legg , Victoria Krakovna
arXiv: Artificial Intelligence

92
2017
AI Safety Gridworlds

Tom Everitt , Shane Legg , Jan Leike , Pedro A. Ortega
arXiv: Learning

263
2017
Soft-Bayes: Prod for Mixtures of Experts with Log-Loss.

Tor Lattimore , Shane Legg , Laurent Orseau
algorithmic learning theory 372 -399

5
2017
Measuring and avoiding side effects using relative reachability

Shane Legg , Miljan Martic , Victoria Krakovna , Laurent Orseau
arXiv: Learning

14
2018
Agents and Devices: A Relative Definition of Agency

Shane Legg , Laurent Orseau , Simon McGregor McGill
arXiv: Learning

10
2018
Single-Agent Policy Tree Search With Guarantees

Tor Lattimore , Théophane Weber , Laurent Orseau , Levi H. S. Lelis
neural information processing systems 31 3201 -3211

1
2018
An Investigation of Model-Free Planning

Arthur Guez , Greg Wayne , Adam Santoro , Karol Gregor
international conference on machine learning 2464 -2473

73
2019
Penalizing Side Effects using Stepwise Relative Reachability.

Shane Legg , Miljan Martic , Victoria Krakovna , Laurent Orseau
international joint conference on artificial intelligence

44
2018
Learning to Prove from Synthetic Theorems.

Doina Precup , Xavier Glorot , Eser Aygün , Ankit Anand
arXiv: Logic in Computer Science

18
2020
Logarithmic Pruning is All You Need

Marcus Hutter , Laurent Orseau , Omar Rivasplata
arXiv: Learning

49
2020
Avoiding Side Effects By Considering Future Tasks

Shane Legg , Miljan Martic , Victoria Krakovna , Laurent Orseau
arXiv: Learning

26
2020
Policy-Guided Heuristic Search with Guarantees.

Laurent Orseau , Levi H. S. Lelis
arXiv: Artificial Intelligence

2021
Optimality issues of universal greedy agents with static priors

Laurent Orseau
algorithmic learning theory 345 -359

21
2010
On Thompson Sampling and Asymptotic Optimality

Jan Leike , Tor Lattimore , Laurent Orseau , Marcus Hutter
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence 4889 -4893

1
2017
Pitfalls of Learning a Reward Function Online

Stuart Armstrong , Jan Leike , Laurent Orseau , Shane Legg
international joint conference on artificial intelligence 2 1592 -1600

15
2020
Space-Time Embedded Intelligence

Laurent Orseau , Mark Ring
Artificial General Intelligence 209 -218

32
2012
Memory Issues of Intelligent Agents

Laurent Orseau , Mark Ring
Artificial General Intelligence 219 -231

5
2012