Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning

Richard S. Sutton , Kristopher De Asis , Silviu Pitis , Daniel Graves
arXiv: Learning

20
2019
Training Recurrent Neural Networks Online by Learning Explicit State Variables

Somjit Nath , Vincent Liu , Alan Chan , Xin Li
international conference on learning representations

3
2020
Automatic prediction of tumour malignancy in breast cancer with fractal dimension.

Alan Chan , Jack A. Tuszynski
Royal Society Open Science 3 ( 12) 160558

43
2016
Greedification operators for policy optimization: Investigating forward and reverse kl divergences

Alan Chan , Hugo Silva , Sungsu Lim , Tadashi Kozuno
The Journal of Machine Learning Research 23 ( 1) 11474 -11552

6
2022
Harms from Increasingly Agentic Algorithmic Systems

Alan Chan , Rebecca Salganik , Alva Markelius , Chris Pang
Smpte Journal 651 -666

2023
Reclaiming the Digital Commons: A Public Data Trust for Training Data

Alan Chan , Herbie Bradley , Nitarshan Rajkumar
arXiv preprint arXiv:2303.09001

1
2023
Foundational challenges in assuring alignment and safety of large language models

Usman Anwar , Abulhair Saparov , Javier Rando , Daniel Paleka
arXiv preprint arXiv:2404.09932

9
2024
Black-Box Access is Insufficient for Rigorous AI Audits

Stephen Casper , Carson Ezell , Charlotte Siegmann , Noam Kolt
Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency

11
2024
An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI

Ross Gruetzemacher , Alan Chan , Kevin Frazier , Christy Manning
Socially Responsible Language Modelling Research (SoLaR) at NeurIPS 2023

3
2023
Characterizing manipulation from AI systems

Micah Carroll , Alan Chan , Henry Ashton , David Krueger
1 -13

26
2023
Visibility into AI Agents

Alan Chan , Carson Ezell , Max Kaufmann , Kevin Wei
958 -973

2
2024
Hazards from Increasingly Accessible Fine-Tuning of Downloadable Foundation Models

Alan Chan , Ben Bucknall , Herbie Bradley , David Krueger
arXiv preprint arXiv:2312.14751

2023
Open-sourcing highly capable foundation models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives

Elizabeth Seger , Noemi Dreksler , Richard Moulange , Emily Dardaman
arXiv preprint arXiv:2311.09227

17
2023
Inverse policy evaluation for value-based sequential decision-making

Alan Chan , Kris De Asis , Richard S Sutton
arXiv preprint arXiv:2008.11329

1
2020
The Limits of Global Inclusion in AI Development

Alan Chan , Chinasa T Okolo , Zachary Terner , Angelina Wang
AAAI 2021 Workshop on Reframing Diversity in AI

24
2021
Towards the scalable evaluation of cooperativeness in language models

Alan Chan , Maxime Riché , Jesse Clifton
arXiv preprint arXiv:2303.13360

3
2023
IDs for AI Systems

Alan Chan , Noam Kolt , Peter Wills , Usman Anwar
arXiv preprint arXiv:2406.12137

2024
Welfare Diplomacy: Benchmarking Language Model Cooperation

Gabriel Mukobi , Hannah Erlebach , Niklas Lauffer , Lewis Hammond
arXiv preprint arXiv:2310.08901

12
2023
Scoring Rules for Performative Binary Prediction

Alan Chan
arXiv preprint arXiv:2207.02847

3
2021