Investigating Action Encodings in Recurrent Neural Networks in Reinforcement Learning

Matthew Kyle Schlegel , Volodymyr Tkachuk , Adam M White , Martha White
Transactions on Machine Learning Research

1
2022
Offline Reinforcement Learning via Tsallis Regularization

Lingwei Zhu , Matthew Kyle Schlegel , Han Wang , Martha White
Transactions on Machine Learning Research

General Munchausen Reinforcement Learning with Tsallis Kullback-Leibler Divergence

Lingwei Zhu , Zheng Chen , Matthew Kyle Schlegel , Martha White
NeurIPS 2023

2
2023
Adapting Kernel Representations Online Using Submodular Maximization

Jiecao Chen , Martha White , Yangchen Pan , Matthew Schlegel
international conference on machine learning 3037 -3046

1
2017
Discovery of Predictive Representations With a Network of General Value Functions

Adam White , Martha White , Matthew Schlegel , Andrew Patterson

2018
Importance Resampling for Off-policy Policy Evaluation

Martha White , Matthew Schlegel , Wesley Chung , Daniel Graves

2018
Context-dependent upper-confidence bounds for directed exploration

Adam White , Raksha Kumaraswamy , Martha White , Matthew Schlegel
neural information processing systems 31 4779 -4789

2018
Importance Resampling for Off-policy Prediction

Martha White , Matthew Schlegel , Wesley Chung , Daniel Graves
neural information processing systems 32 1799 -1809

12
2019
General Value Function Networks

Adam White , Martha White , Matthew Schlegel , Andrew Patterson
Journal of Artificial Intelligence Research 70 497 -543

2021
Meta-Descent for Online, Continual Prediction

Andrew Jacobsen , Matthew Schlegel , Cameron Linke , Thomas Degris
national conference on artificial intelligence 33 ( 01) 3943 -3950

18
2019
Continual auxiliary task learning

Matthew McLeod , Chunlok Lo , Matthew Schlegel , Andrew Jacobsen
Advances in Neural Information Processing Systems 34 12549 -12562

2
2021
Structural credit assignment in neural networks using reinforcement learning

Dhawal Gupta , Gabor Mihucz , Matthew Schlegel , James Kostas
Advances in Neural Information Processing Systems 34 30257 -30270

2
2021
A Baseline of Discovery for General Value Function Networks under Partial Observability

Matthew Schlegel , Adam White , Martha White
NeurIPS Workshop on Reinforcement Learning under Partial Observability): Montreal, Canada

4
2018
Stable predictive representations with general value functions for continual learning

Matthew Schlegel , Adam White , Martha White
Continual Learning and Deep Networks workshop at the Neural Information Processing System Conference

2
2017
Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence

Lingwei Zhu , Zheng Chen , Matthew Schlegel , Martha White
arXiv preprint arXiv:2301.11476

1
2023
General Munchausen Reinforcement Learning with Tsallis Kullback-Leibler Divergence

Lingwei Zhu , Zheng Chen , Matthew Schlegel , Martha White
Advances in Neural Information Processing Systems 36

2024
Predictions Predicting Predictions

M Schlegel , Martha White
The 5th Multi-disciplinary Conference on Reinforcement Learning and Decision Making

1
2022