Stochastic optimal control with dynamic, time-consistent risk constraints

Yin-Lam Chow , Marco Pavone
american control conference 390 -395

12
2013
A uniform-grid discretization algorithm for stochastic optimal control with risk constraints

Yin-Lam Chow , Marco Pavone
conference on decision and control 2465 -2470

3
2013
Preference Elicitation with Soft Attributes in Interactive Recommendation

FAN YAO , YINLAM CHOW , ALEX HAIG , CHIH-WEI HSU

Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach

Marco Pavone , Aviv Tamar , Yinlam Chow , Shie Mannor
arXiv: Artificial Intelligence

265
2015
Algorithms for CVaR Optimization in MDPs

Mohammad Ghavamzadeh , Yinlam Chow
neural information processing systems 27 3509 -3517

64
2014
Policy gradient for coherent risk measures

Mohammad Ghavamzadeh , Aviv Tamar , Yinlam Chow , Shie Mannor
neural information processing systems 28 1468 -1476

26
2015
Sequential Multiple Hypothesis Testing with Type I Error Control.

Mohammad Ghavamzadeh , Alan Malek , Yinlam Chow , Sumeet Katariya
international conference on artificial intelligence and statistics 1468 -1476

7
2017
Path Consistency Learning in Tsallis Entropy Regularized MDPs

Mohammad Ghavamzadeh , Yinlam Chow , Ofir Nachum
arXiv: Artificial Intelligence

28
2018
Imitation Learning from Visual Data with Multiple Intentions

Aviv Tamar , Yinlam Chow , Michael Kahane , Khashayar Rohanimanesh
international conference on learning representations

5
2018
Risk-Sensitive Generative Adversarial Imitation Learning

Mohammad Ghavamzadeh , Marco Pavone , Yinlam Chow , Jonathan Lacotte
international conference on artificial intelligence and statistics 2154 -2163

26
2018
A Block Coordinate Ascent Algorithm for Mean-Variance Optimization

Mohammad Ghavamzadeh , Yinlam Chow , Yangyang Xu , Bo Liu
neural information processing systems 31 1065 -1075

12
2018
Lyapunov-based Safe Policy Optimization for Continuous Control

Mohammad Ghavamzadeh , Yinlam Chow , Aleksandra Faust , Edgar Duenez-Guzman
arXiv: Learning

164
2019
Lyapunov-based Safe Policy Optimization

Mohammad Ghavamzadeh , Yinlam Chow , Ofir Nachum , Edgar Guzman-Duenez

2018
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections

Bo Dai , Yinlam Chow , Lihong Li , Ofir Nachum
arXiv: Learning

220
2019
More Robust Doubly Robust Off-policy Evaluation

Mohammad Ghavamzadeh , Mehrdad Farajtabar , Yinlam Chow
international conference on machine learning 1446 -1455

191
2018
Safe Policy Improvement by Minimizing Robust Baseline Regret

Mohammad Ghavamzadeh , Marek Petrik , Yinlam Chow
neural information processing systems 29 2298 -2306

35
2016
A Lyapunov-based Approach to Safe Reinforcement Learning

Mohammad Ghavamzadeh , Yinlam Chow , Edgar A. Duéñez-Guzmán , Ofir Nachum
neural information processing systems 31 8092 -8101

379
2018
CAQL: Continuous Action Q-Learning

Craig Boutilier , Ross Anderson , Yinlam Chow , Moonkyung Ryu
arXiv: Learning

32
2019
AlgaeDICE: Policy Gradient from Arbitrary Experience.

Dale Schuurmans , Bo Dai , Yinlam Chow , Lihong Li
arXiv: Learning

137
2019