Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Gemini Team , Petko Georgiev , Ving Ian Lei , Ryan Burnell
arXiv preprint arXiv:2403.05530

39
2024
Investigating the properties of neural network representations in reinforcement learning

Han Wang , Erfan Miahi , Martha White , Marlos C Machado
Artificial Intelligence 104100 -104100

16
2024
Many-Shot In-Context Learning

Rishabh Agarwal , Avi Singh , Lei M Zhang , Bernd Bohnet
arXiv preprint arXiv:2404.11018

2
2024
From eye-blinks to state construction: Diagnostic benchmarks for online representation learning

Banafsheh Rafiee , Zaheer Abbas , Sina Ghiassian , Raksha Kumaraswamy
Adaptive Behavior 31 3 -19

8
2023
Planning with expectation models

Yi Wan , Zaheer Abbas , Adam White , Martha White
arXiv preprint arXiv:1904.01191

26
2019
Loss of plasticity in continual deep reinforcement learning

Zaheer Abbas , Rosie Zhao , Joseph Modayil , Adam White
Conference on Lifelong Learning Agents 620 -636

36
2023
Towards model-free RL algorithms that scale well with unstructured data

Joseph Modayil , Zaheer Abbas
arXiv preprint arXiv:2311.02215

2023
Organizing Experience: a Deeper Look at Replay Mechanisms for Sample-Based Planning in Continuous State Domains

Yangchen Pan , Zaheer Abbas , Adam White , Andrew Patterson
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence 4794 -4800

49
2018
General value function networks

Matthew Schlegel , Andrew Jacobsen , Zaheer Abbas , Andrew Patterson
arXiv preprint arXiv:1807.06763

42
2018
Selective Dyna-style Planning Under Limited Model Capacity

Zaheer Abbas , Samuel Sokota , Erin J Talvitie , Martha White
ICML'20

36
2020
Model-based reinforcement learning with non-linear expectation models and stochastic environments

Yi Wan , Zaheer Abbas , Martha White , Richard S Sutton
FAIM Workshop on Prediction and Generative Modeling in Reinforcement Learning 1 -5

6
2018
Incrementally Learning Functions of the Return

Brendan Bennett , Wesley Chung , Zaheer Abbas , Vincent Liu
arXiv preprint arXiv:1907.04651

1
2019
Gemini: a family of highly capable multimodal models

Gemini Team , Rohan Anil , Sebastian Borgeaud , Jean-Baptiste Alayrac
arXiv preprint arXiv:2312.11805

695
2023