Stefano V. Albrecht

Stefano V Albrecht , Mihai Dobre , Ram Ramamoorthy , Francisco Eiras
Interaction and Decision-Making in Autonomous-Driving: A Virtual Workshop at RSS 2020

2020

Learning task embeddings for teamwork adaptation in multi-agent reinforcement learning

Lukas Schäfer , Filippos Christianos , Amos Storkey , Stefano V Albrecht
arXiv preprint arXiv:2207.02249

2022

Deep reinforcement learning for multi-agent interaction

Ibrahim H Ahmed , Cillian Brewitt , Ignacio Carlucho , Filippos Christianos
AI Communications ( Preprint) 1 -12

2022

Robust on-policy data collection for data-efficient policy evaluation

Rujie Zhong , Josiah P Hanna , Lukas Schäfer , Stefano V Albrecht
Smpte Journal

2021

Temporal disentanglement of representations for improved generalisation in reinforcement learning

Mhairi Dunion , Trevor McInroe , Kevin Sebastian Luck , Josiah P Hanna
arXiv preprint arXiv:2207.05480

2022

Learning Complex Teamwork Tasks using a Sub-task Curriculum

Elliot Fosong , Arrasy Rahman , Ignacio Carlucho , Stefano V Albrecht
arXiv preprint arXiv:2302.04944

2023

Decoupled reinforcement learning to stabilise intrinsically-motivated exploration

Lukas Schäfer , Filippos Christianos , Josiah P Hanna , Stefano V Albrecht
arXiv preprint arXiv:2107.08966

2021

Revisiting the Gumbel-Softmax in MADDPG

Callum Rhys Tilbury , Filippos Christianos , Stefano V Albrecht
arXiv preprint arXiv:2302.11793

2023

Learning representations for control with hierarchical forward models

Trevor McInroe , Lukas Schäfer , Stefano V Albrecht
arXiv preprint arXiv:2206.11396

2022

A Survey of Ad Hoc Teamwork Research

William Macke , Mohan Sridharan , Peter Stone , Stefano V Albrecht
Smpte Journal 13442 275

2022

Learning temporally-consistent representations for data-efficient reinforcement learning

Trevor McInroe , Lukas Schäfer , Stefano V Albrecht
arXiv preprint arXiv:2110.04935

2021

Emergent behaviours in multi-agent systems with Evolutionary Game Theory.

Stefano V Albrecht , Michael Woolridge ,
AI Communications 35 ( 4)

2022

Local information agent modelling in partially-observable environments

Georgios Papoudakis , Filippos Christianos , Stefano V Albrecht
arXiv preprint arXiv:2006.09447

2021

Expressivity of Emergent Language is a Trade-off between Contextual Complexity and Unpredictability

Shangmin Guo , Yi Ren , Kory Mathewson , Simon Kirby
International Conference on Learning Representations

2022

ICED: Zero-Shot Transfer in Reinforcement Learning via In-Context Environment Design

Samuel Garcin , James Doran , Shangmin Guo , Christopher G Lucas
arXiv preprint arXiv:2402.03479

2024

Planning to go out-of-distribution in offline-to-online reinforcement learning

Trevor McInroe , Stefano V Albrecht , Amos Storkey
arXiv preprint arXiv:2310.05723

2023

LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots

Dongge Han , Trevor McInroe , Adam Jelley , Stefano V Albrecht
arXiv preprint arXiv:2404.14285

2024

Sample Relationship from Learning Dynamics Matters for Generalisation

Shangmin Guo , Yi Ren , Stefano V Albrecht , Kenny Smith
arXiv preprint arXiv:2401.08808

2024

Artificial Intelligence

Autonomous Agents

Multi-Agent Systems

Reinforcement Learning

Autonomous Driving

Stefano V. Albrecht

引用次数

出版物: 106

我的账户