Yan Duan

机构: Covariant.AI

主页: rockyduan.com

每年引用次数

引用次数

引用: 17,187

H-指数: 22

I10-指数 : 23

出版物: 33

标题

引用次数

年份

Benchmarking Deep Reinforcement Learning for Continuous Control

Rein Houthooft , Pieter Abbeel , Yan Duan , John Schulman
arXiv: Learning

1,659

2016

VIME: Variational Information Maximizing Exploration

Rein Houthooft , Pieter Abbeel , Filip De Turck , Yan Duan
arXiv: Learning

702

2016

Variational Lossy Autoencoder

Ilya Sutskever , Pieter Abbeel , Tim Salimans , Diederik P. Kingma
international conference on learning representations

651

2016

One-Shot Imitation Learning

Ilya Sutskever , Pieter Abbeel , Marcin Andrychowicz , Yan Duan
arXiv: Artificial Intelligence

635

2017

Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines

Sham Kakade , Pieter Abbeel , Igor Mordatch , Yan Duan
arXiv: Learning

138

2018

Meta Learning for Control

Yan Duan

2017

The Importance of Sampling inMeta-Reinforcement Learning

Rein Houthooft , Ilya Sutskever , Pieter Abbeel , Yan Duan
neural information processing systems 31 9280 -9290

2018

Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design

Pieter Abbeel , Yan Duan , Jonathan Ho , Aravind Srinivas
international conference on machine learning 2722 -2730

320

2019

Adversarial Attacks on Neural Network Policies

Sandy Huang , Nicolas Papernot , Ian Goodfellow , Yan Duan
arXiv: Learning

706

2017

#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

Rein Houthooft , Pieter Abbeel , Filip De Turck , Yan Duan
arXiv: Artificial Intelligence

521

2016

InfoGAN: interpretable representation learning by information maximizing generative adversarial nets

Rein Houthooft , Ilya Sutskever , Pieter Abbeel , Yan Duan
neural information processing systems 29 2180 -2188

4,256

2016

Stochastic Neural Networks for Hierarchical Reinforcement Learning

Pieter Abbeel , Yan Duan , Carlos Florensa
international conference on learning representations

353

2016

Model-Ensemble Trust-Region Policy Optimization

Pieter Abbeel , Aviv Tamar , Yan Duan , Ignasi Clavera
international conference on learning representations

391

2018

Deep Unsupervised Cardinality Estimation

Pieter Abbeel , Joseph M. Hellerstein , Sanjay Krishnan , Yan Duan
arXiv: Databases

141

2019

NeuroCard: One Cardinality Estimator for All Tables

Yan Duan , Ion Stoica , Eric Liang , Zongheng Yang
arXiv: Databases

2020

Planning locally optimal, curvature-constrained trajectories in 3D using sequential convex optimization

Yan Duan , Sachin Patil , John Schulman , Ken Goldberg
international conference on robotics and automation 5889 -5895

2014

Gaussian belief space planning with discontinuities in sensing domains

Sachin Patil , Yan Duan , John Schulman , Ken Goldberg
international conference on robotics and automation 6483 -6490

2014

Motion planning with sequential convex optimization and convex collision checking

Ken Goldberg , Pieter Abbeel , John Schulman , Yan Duan
The International Journal of Robotics Research 33 ( 9) 1251 -1270

650

2014

Evaluating Protein Transfer Learning with TAPE

Roshan Rao , Nicholas Bhattacharya , Neil Thomas , Yan Duan
bioRxiv 676825

459

2019

Attacking machine learning with adversarial examples

Ian Goodfellow , Nicolas Papernot , Sandy Huang , Yan Duan
OpenAI Blog 24 1 -1

2017

Robotics

Machine Learning

Reinforcement Learning

Meta Learning

Yan Duan

引用次数

出版物: 33

我的账户