In search of robust measures of generalization

Gintare Karolina Dziugaite , Alexandre Drouin , Brady Neal , Daniel M. Roy
neural information processing systems 33 11723 -11733

51
2020
RL: Generic reinforcement learning codebase in TensorFlow

Bryan Li , Alexander Cowen-Rivers , Piotr Kozakowski , David Tao
Journal of Open Source Software 4 ( 42) 1524

2019
Pretraining representations for data-efficient reinforcement learning

Max Schwarzer , Nitarshan Rajkumar , Michael Noukhovitch , Ankesh Anand
Advances in Neural Information Processing Systems 34 12686 -12699

52
2021
Harms from Increasingly Agentic Algorithmic Systems

Alan Chan , Rebecca Salganik , Alva Markelius , Chris Pang
Smpte Journal 651 -666

2023
Myriad: a real-world testbed to bridge trajectory optimization and deep learning

Nikolaus Howe , Simon Dufort-Labbé , Nitarshan Rajkumar , Pierre-Luc Bacon
Advances in Neural Information Processing Systems 35 29801 -29815

2022
Metadata archaeology: Unearthing data subsets by leveraging training dynamics

Shoaib Ahmed Siddiqui , Nitarshan Rajkumar , Tegan Maharaj , David Krueger
arXiv preprint arXiv:2209.10015

2022
Reclaiming the Digital Commons: A Public Data Trust for Training Data

Alan Chan , Herbie Bradley , Nitarshan Rajkumar
arXiv preprint arXiv:2303.09001

1
2023
Evaluating the text-to-sql capabilities of large language models

Nitarshan Rajkumar , Raymond Li , Dzmitry Bahdanau
arXiv preprint arXiv:2204.00498

80
2022
Visibility into AI Agents

Alan Chan , Carson Ezell , Max Kaufmann , Kevin Wei
958 -973

2
2024
A New National Purpose: Innovation Can Power the Future of Britain

Jeegar Kakkad , Benedict Macon-Cooney , Jess Northend , James Phillips
Tony Blair Institute for Global Change

2
2023
Evaluating the text-to-SQL capabilities of large language models. arXiv

N Rajkumar , R Li , D Bahdanau
arXiv preprint arXiv:2204.00498

5
2022