Socially Aware Motion Planning with Deep Reinforcement Learning

作者： Jonathan P. How , Yu Fan Chen , Michael Everett , Miao Liu

DOI:

关键词:

摘要: For robotic vehicles to navigate safely and efficiently in pedestrian-rich environments, it is important model subtle human behaviors navigation rules (e.g., passing on the right). However, while instinctive humans, socially compliant still difficult quantify due stochasticity people's behaviors. Existing works are mostly focused using feature-matching techniques describe imitate paths, but often do not generalize well since feature values can vary from person person, even run run. This work notes that challenging directly specify details of what (precise mechanisms navigation), straightforward (violations social norms). Specifically, deep reinforcement learning, this develops a time-efficient policy respects common norms. The proposed method shown enable fully autonomous vehicle moving at walking speed an environment with many pedestrians.

参考文章(22)

Henrik Kretzschmar, Markus Spies, Christoph Sprunk, Wolfram Burgard, Socially compliant mobile robot navigation via inverse reinforcement learning The International Journal of Robotics Research. ,vol. 35, pp. 1289- 1307 ,(2016) , 10.1177/0278364915619772

Volodymyr Mnih, Koray Kavukcuoglu, Alex Graves, Tim Harley, Adrià Puigdomènech Badia, David Silver, Mehdi Mirza, Timothy P. Lillicrap, Asynchronous Methods for Deep Reinforcement Learning arXiv: Learning. ,(2016)

Yu Fan Chen, Shih-Yuan Liu, Miao Liu, Justin Miller, Jonathan P. How, Motion planning with diffusion maps intelligent robots and systems. pp. 1423- 1430 ,(2016) , 10.1109/IROS.2016.7759232

Dhanvin Mehta, Gonzalo Ferrer, Edwin Olson, Autonomous navigation in dynamic social environments using Multi-Policy Decision Making intelligent robots and systems. pp. 1190- 1197 ,(2016) , 10.1109/IROS.2016.7759200

Fereshteh Sadeghi, Sergey Levine, CAD2RL: Real Single-Image Flight without a Single Real Image arXiv: Learning. ,(2016)

Lawrence Carin, Jonathan P. How, Trevor Campbell, Brian Kulis, Miao Liu, Dynamic Clustering via Asymptotics of the Dependent Dirichlet Process Mixture arXiv: Learning. ,(2013)

Jonathan P. How, Andres Hasfura, Shih-Yuan Liu, Justin Miller, Dynamic arrival rate estimation for campus Mobility On Demand network graphs intelligent robots and systems. pp. 2285- 2292 ,(2016) , 10.1109/IROS.2016.7759357

Jur van den Berg, Stephen J. Guy, Ming Lin, Dinesh Manocha, Reciprocal n-Body Collision Avoidance Springer Tracts in Advanced Robotics. pp. 3- 19 ,(2011) , 10.1007/978-3-642-19457-3_1

Vaibhav V. Unhelkar, Claudia Perez-D'Arpino, Leia Stirling, Julie A. Shah, Human-robot co-navigation using anticipatory indicators of human walking motion international conference on robotics and automation. pp. 6183- 6190 ,(2015) , 10.1109/ICRA.2015.7140067

10.

Haoyu Bai, Shaojun Cai, Nan Ye, David Hsu, Wee Sun Lee, Intention-aware online POMDP planning for autonomous driving in a crowd international conference on robotics and automation. pp. 454- 460 ,(2015) , 10.1109/ICRA.2015.7139219

Socially Aware Motion Planning with Deep Reinforcement Learning

来源期刊

我的账户

Socially Aware Motion Planning with Deep Reinforcement Learning

来源期刊

相似文章 10

我的账户