Benchmarking Deep Reinforcement Learning for Continuous Control

Rein Houthooft , Pieter Abbeel , Yan Duan , John Schulman
arXiv: Learning

1,659
2016
VIME: Variational Information Maximizing Exploration

Rein Houthooft , Pieter Abbeel , Filip De Turck , Yan Duan
arXiv: Learning

702
2016
Variational Lossy Autoencoder

Ilya Sutskever , Pieter Abbeel , Tim Salimans , Diederik P. Kingma
international conference on learning representations

651
2016
One-Shot Imitation Learning

Ilya Sutskever , Pieter Abbeel , Marcin Andrychowicz , Yan Duan
arXiv: Artificial Intelligence

635
2017
Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines

Sham Kakade , Pieter Abbeel , Igor Mordatch , Yan Duan
arXiv: Learning

138
2018
7
2017
The Importance of Sampling inMeta-Reinforcement Learning

Rein Houthooft , Ilya Sutskever , Pieter Abbeel , Yan Duan
neural information processing systems 31 9280 -9290

31
2018
Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design

Pieter Abbeel , Yan Duan , Jonathan Ho , Aravind Srinivas
international conference on machine learning 2722 -2730

320
2019
Adversarial Attacks on Neural Network Policies

Sandy Huang , Nicolas Papernot , Ian Goodfellow , Yan Duan
arXiv: Learning

706
2017
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

Rein Houthooft , Pieter Abbeel , Filip De Turck , Yan Duan
arXiv: Artificial Intelligence

521
2016
InfoGAN: interpretable representation learning by information maximizing generative adversarial nets

Rein Houthooft , Ilya Sutskever , Pieter Abbeel , Yan Duan
neural information processing systems 29 2180 -2188

4,256
2016
Stochastic Neural Networks for Hierarchical Reinforcement Learning

Pieter Abbeel , Yan Duan , Carlos Florensa
international conference on learning representations

353
2016
Model-Ensemble Trust-Region Policy Optimization

Pieter Abbeel , Aviv Tamar , Yan Duan , Ignasi Clavera
international conference on learning representations

391
2018
Deep Unsupervised Cardinality Estimation

Pieter Abbeel , Joseph M. Hellerstein , Sanjay Krishnan , Yan Duan
arXiv: Databases

141
2019
NeuroCard: One Cardinality Estimator for All Tables

Yan Duan , Ion Stoica , Eric Liang , Zongheng Yang
arXiv: Databases

75
2020
Planning locally optimal, curvature-constrained trajectories in 3D using sequential convex optimization

Yan Duan , Sachin Patil , John Schulman , Ken Goldberg
international conference on robotics and automation 5889 -5895

21
2014
Gaussian belief space planning with discontinuities in sensing domains

Sachin Patil , Yan Duan , John Schulman , Ken Goldberg
international conference on robotics and automation 6483 -6490

37
2014
Motion planning with sequential convex optimization and convex collision checking

Ken Goldberg , Pieter Abbeel , John Schulman , Yan Duan
The International Journal of Robotics Research 33 ( 9) 1251 -1270

650
2014
Evaluating Protein Transfer Learning with TAPE

Roshan Rao , Nicholas Bhattacharya , Neil Thomas , Yan Duan
bioRxiv 676825

459
2019
Attacking machine learning with adversarial examples

Ian Goodfellow , Nicolas Papernot , Sandy Huang , Yan Duan
OpenAI Blog 24 1 -1

78
2017