Emphatic Temporal-Difference Learning.

Ashique Rupam Mahmood , Richard S. Sutton , Martha White , Huizhen Yu
arXiv: Learning

23
2015
An emphatic approach to the problem of off-policy temporal-difference learning

Richard S. Sutton , A. Rupam Mahmood , Martha White
Journal of Machine Learning Research 17 ( 1) 2603 -2631

95
2016
Interval Estimation for Reinforcement-Learning Algorithms in Continuous-State Domains

Adam White , Martha White
neural information processing systems 23 2433 -2441

22
2010
Generalized Optimal Reverse Prediction

Dale Schuurmans , Martha White
international conference on artificial intelligence and statistics 1305 -1313

5
2012
Learning a value analysis tool for agent evaluation

Michael Bowling , Martha White
international joint conference on artificial intelligence 1976 -1981

17
2009
Off-Policy Actor-Critic

Richard S. Sutton , Thomas Degris , Martha White
international conference on machine learning 179 -186

49
2012
Convex Multi-view Subspace Learning

Dale Schuurmans , Yao-liang Yu , Xinhua Zhang , Martha White
neural information processing systems 25 1673 -1681

91
2012
Optimal estimation of multivariate ARMA models

Dale Schuurmans , Junfeng Wen , Michael Bowling , Martha White
national conference on artificial intelligence 3080 -3086

6
2015
Estimating the class prior and posterior from noisy positives and unlabeled data

Predrag Radivojac , Martha White , Shantanu Jain
arXiv: Machine Learning

107
2016
Adapting Kernel Representations Online Using Submodular Maximization

Jiecao Chen , Martha White , Yangchen Pan , Matthew Schlegel
international conference on machine learning 3037 -3046

1
2017
Multi-view Matrix Factorization for Linear Dynamical System Estimation

Csaba Szepesvari , Dale Schuurmans , Mahdi Karami , Martha White
neural information processing systems 30 7092 -7101

3
2017
Directly Estimating the Variance of the {\lambda}-Return Using Temporal-Difference Methods

Craig Sherstan , Richard S. Sutton , Adam White , Kenny Young
arXiv: Artificial Intelligence

6
2018
High-confidence error estimates for learned value functions.

Martha White , Touqir Sajed , Wesley Chung
uncertainty in artificial intelligence 683 -692

3
2018
Supervised autoencoders: Improving generalization performance with unsupervised regularizers

Lei Le , Martha White , Andrew Patterson
neural information processing systems 31 107 -117

193
2018
1
2015
Accelerating Large Scale Knowledge Distillation via Dynamic Importance Sampling.

Weishi Zheng , Martha White , Tanli Zuo , Ruicheng Li
arXiv: Learning

3
2018
Two-Timescale Networks for Nonlinear Value Function Approximation

Martha White , Ajin Joseph , Wesley Chung , Somjit Nath
international conference on learning representations

31
2019
A Greedy Approach to Adapting the Trace Parameter for Temporal Difference Learning

Adam White , Martha White
adaptive agents and multi-agents systems 557 -565

18
2016
Effective sketching methods for value function approximation.

Martha White , Erfan Sadeqi Azer , Yangchen Pan
uncertainty in artificial intelligence

1
2017
Investigating Practical Linear Temporal Difference Learning

Martha White , Adam Adam
adaptive agents and multi-agents systems 494 -502

8
2016