Analyzing the Hidden Activations of Deep Policy Networks: Why Representation Matters

Trevor A McInroe , Michael Spurrier , Jennifer Sieber , Stephen Conneely
arXiv preprint arXiv:2103.06398

1
2021
Deep reinforcement learning for multi-agent interaction

Ibrahim H Ahmed , Cillian Brewitt , Ignacio Carlucho , Filippos Christianos
AI Communications ( Preprint) 1 -12

3
2022
Temporal disentanglement of representations for improved generalisation in reinforcement learning

Mhairi Dunion , Trevor McInroe , Kevin Sebastian Luck , Josiah P Hanna
arXiv preprint arXiv:2207.05480

2
2022
Learning representations for control with hierarchical forward models

Trevor McInroe , Lukas Schäfer , Stefano V Albrecht
arXiv preprint arXiv:2206.11396

1
2022
Learning temporally-consistent representations for data-efficient reinforcement learning

Trevor McInroe , Lukas Schäfer , Stefano V Albrecht
arXiv preprint arXiv:2110.04935

3
2021
Planning to go out-of-distribution in offline-to-online reinforcement learning

Trevor McInroe , Stefano V Albrecht , Amos Storkey
arXiv preprint arXiv:2310.05723

2
2023
LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots

Dongge Han , Trevor McInroe , Adam Jelley , Stefano V Albrecht
arXiv preprint arXiv:2404.14285

2024
Conditional Mutual Information for Disentangled Representations in Reinforcement Learning

Mhairi Dunion , Trevor McInroe , Kevin Sebastian Luck , Josiah Hanna
Advances in Neural Information Processing Systems 36

3
2024
Efficient Offline Reinforcement Learning: The Critic is Critical

Adam Jelley , Trevor McInroe , Sam Devlin , Amos Storkey
arXiv preprint arXiv:2406.13376

2024
Safe and Efficient Offline Reinforcement Learning: The Critic is Critical

Adam Jelley , Trevor McInroe , Sam Devlin , Amos Storkey
First Reinforcement Learning Safety Workshop