Adapting Kernel Representations Online Using Submodular Maximization

Jiecao Chen , Martha White , Yangchen Pan , Matthew Schlegel
international conference on machine learning 3037 -3046

1
2017
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning

Alona Fyshe , Martha White , Yangchen Pan , Qingfeng Lan
arXiv: Artificial Intelligence

89
2020
An implicit function learning approach for parametric modal regression

Amir-massoud Farahmand , Martha White , Yangchen Pan , Ehsan Imani
neural information processing systems 33 11442 -11452

1
2020
Organizing Experience: a Deeper Look at Replay Mechanisms for Sample-Based Planning in Continuous State Domains

Yangchen Pan , Zaheer Abbas , Adam White , Andrew Patterson
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence 4794 -4800

49
2018
Understanding and mitigating the limitations of prioritized experience replay

Yangchen Pan , Jincheng Mei , Amir-massoud Farahmand , Martha White
Smpte Journal 1561 -1571

1
2022
An alternate policy gradient estimator for softmax policies

Shivam Garg , Samuele Tosatto , Yangchen Pan , Martha White
arXiv preprint arXiv:2112.11622

1
2021
An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models

Yangchen Pan , Junfeng Wen , Chenjun Xiao , Philip Torr
arXiv preprint arXiv:2404.15518

2024
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization

Yudong Luo , Yangchen Pan , Han Wang , Philip Torr
arXiv preprint arXiv:2403.11062

2024
Improving Adversarial Transferability via Model Alignment

Avery Ma , Amir-massoud Farahmand , Yangchen Pan , Philip Torr
arXiv preprint arXiv:2311.18495

2023
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online

Yangchen Pan , Kirby Banman , Martha White
International Conference on Learning Representations

83
2021
The In-Sample Softmax for Offline Reinforcement Learning

Chenjun Xiao , Han Wang , Yangchen Pan , Adam White
arXiv preprint arXiv:2302.14372

24
2023
Incremental truncated LSTD

Clement Gehring , Yangchen Pan , Martha White
International Joint Conference on Artificial Intelligence

15
2015
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement

Samuel Neumann , Sungsu Lim , Ajin Joseph , Yangchen Pan
arXiv e-prints arXiv: 1810.09103 -arXiv: 1810.09103

4
2018
An alternative to variance: Gini deviation for risk-averse policy gradient

Yudong Luo , Guiliang Liu , Pascal Poupart , Yangchen Pan
Advances in Neural Information Processing Systems 36 60922 -60946

1
2023
Effective sketching methods for value function approximation

Yangchen Pan , Erfan Sadeqi Azer , Martha White
arXiv preprint arXiv:1708.01298

13
2017
Conditionally optimistic exploration for cooperative deep multi-agent reinforcement learning

Xutong Zhao , Yangchen Pan , Chenjun Xiao , Sarath Chandar
Uncertainty in Artificial Intelligence 2529 -2540

1
2023
Frequency-based Search-control in Dyna

Yangchen Pan , Jincheng Mei , Amir-massoud Farahmand
arXiv preprint arXiv:2002.05822

14
2020
DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime

Zhiyao Luo , Mingcheng Zhu , Fenglin Liu , Jiali Li
arXiv preprint arXiv:2405.18610

2024
Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods

Avery Ma , Yangchen Pan , Amir-massoud Farahmand
Transactions on Machine Learning Research (TMLR)

3
2023