SVM Toolbox

SV Albrecht
Darmstadt University of Technology Department of Computer Science Multimodal Interactive Systems

1
2017
Prediction and planning for mobile robots

Subramanian Ramamoorthy , Mihai DOBRE , Roberto ANTOLIN , Stefano ALBRECHT

2021
Autonomous Driving with Interpretable Goal Recognition and Monte Carlo Tree Search

Stefano V Albrecht , Mihai Dobre , Ram Ramamoorthy , Francisco Eiras
Interaction and Decision-Making in Autonomous-Driving: A Virtual Workshop at RSS 2020

2020
Learning task embeddings for teamwork adaptation in multi-agent reinforcement learning

Lukas Schäfer , Filippos Christianos , Amos Storkey , Stefano V Albrecht
arXiv preprint arXiv:2207.02249

1
2022
Deep reinforcement learning for multi-agent interaction

Ibrahim H Ahmed , Cillian Brewitt , Ignacio Carlucho , Filippos Christianos
AI Communications ( Preprint) 1 -12

3
2022
Robust on-policy data collection for data-efficient policy evaluation

Rujie Zhong , Josiah P Hanna , Lukas Schäfer , Stefano V Albrecht
Smpte Journal

2
2021
Temporal disentanglement of representations for improved generalisation in reinforcement learning

Mhairi Dunion , Trevor McInroe , Kevin Sebastian Luck , Josiah P Hanna
arXiv preprint arXiv:2207.05480

2
2022
Learning Complex Teamwork Tasks using a Sub-task Curriculum

Elliot Fosong , Arrasy Rahman , Ignacio Carlucho , Stefano V Albrecht
arXiv preprint arXiv:2302.04944

2023
Decoupled reinforcement learning to stabilise intrinsically-motivated exploration

Lukas Schäfer , Filippos Christianos , Josiah P Hanna , Stefano V Albrecht
arXiv preprint arXiv:2107.08966

7
2021
Revisiting the Gumbel-Softmax in MADDPG

Callum Rhys Tilbury , Filippos Christianos , Stefano V Albrecht
arXiv preprint arXiv:2302.11793

2023
Learning representations for control with hierarchical forward models

Trevor McInroe , Lukas Schäfer , Stefano V Albrecht
arXiv preprint arXiv:2206.11396

1
2022
A Survey of Ad Hoc Teamwork Research

William Macke , Mohan Sridharan , Peter Stone , Stefano V Albrecht
Smpte Journal 13442 275

2022
Learning temporally-consistent representations for data-efficient reinforcement learning

Trevor McInroe , Lukas Schäfer , Stefano V Albrecht
arXiv preprint arXiv:2110.04935

3
2021
Emergent behaviours in multi-agent systems with Evolutionary Game Theory.

Stefano V Albrecht , Michael Woolridge ,
AI Communications 35 ( 4)

2
2022
Local information agent modelling in partially-observable environments

Georgios Papoudakis , Filippos Christianos , Stefano V Albrecht
arXiv preprint arXiv:2006.09447

5
2021
Expressivity of Emergent Language is a Trade-off between Contextual Complexity and Unpredictability

Shangmin Guo , Yi Ren , Kory Mathewson , Simon Kirby
International Conference on Learning Representations

14
2022
ICED: Zero-Shot Transfer in Reinforcement Learning via In-Context Environment Design

Samuel Garcin , James Doran , Shangmin Guo , Christopher G Lucas
arXiv preprint arXiv:2402.03479

2024
Planning to go out-of-distribution in offline-to-online reinforcement learning

Trevor McInroe , Stefano V Albrecht , Amos Storkey
arXiv preprint arXiv:2310.05723

2
2023
LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots

Dongge Han , Trevor McInroe , Adam Jelley , Stefano V Albrecht
arXiv preprint arXiv:2404.14285

2024
Sample Relationship from Learning Dynamics Matters for Generalisation

Shangmin Guo , Yi Ren , Stefano V Albrecht , Kenny Smith
arXiv preprint arXiv:2401.08808

2024