Deep, Consistent Behavioral Decision Making with Planning Features for Autonomous Vehicles

作者: Lilin Qian , Xin Xu , Yujun Zeng , Junwen Huang

DOI: 10.3390/ELECTRONICS8121492

关键词:

摘要: Autonomous driving promises to be the main trend in future intelligent transportation systems due its potentiality for energy saving, and traffic safety improvements. However, traditional autonomous vehicles’ behavioral decisions face consistency issues between decision trajectory planning shows a strong dependence on human experience. In this paper, we present planning-feature-based deep behavior method (PFBD) complex, dynamic traffic. We used reinforcement learning (DRL) framework with twin delayed deterministic policy gradient algorithm (TD3) exploit optimal policy. took into account features of topological routes making vehicles, through which path layers can guaranteed. Specifically, route extracted from space are shared as input states decision. The actor-network learns near-optimal feasible safe candidate emulated routes. Simulation tests three typical scenarios have been performed demonstrate performance policy, including comparison rule-based expert considering partial information contour. results show that proposed approach achieve better decisions. Real-time test an HQ3 (HongQi third ) vehicle also validated effectiveness PFBD.

参考文章(21)
Arda Kurt, Ümit Özgüner, Hierarchical finite state machines for autonomous mobile systems Control Engineering Practice. ,vol. 21, pp. 184- 194 ,(2013) , 10.1016/J.CONENGPRAC.2012.09.020
Dean A. Pomerleau, ALVINN: An Autonomous Land Vehicle in a Neural Network neural information processing systems. ,vol. 1, pp. 305- 313 ,(1988) , 10.1184/R1/6603146.V1
Chuanqiang Lian, Xin Xu, Hong Chen, Haibo He, Near-Optimal Tracking Control of Mobile Robots Via Receding-Horizon Dual Heuristic Programming IEEE Transactions on Systems, Man, and Cybernetics. ,vol. 46, pp. 2484- 2496 ,(2016) , 10.1109/TCYB.2015.2478857
David Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel, Demis Hassabis, None, Mastering the game of Go with deep neural networks and tree search Nature. ,vol. 529, pp. 484- 489 ,(2016) , 10.1038/NATURE16961
Beat Flepp, Lawrence D. Jackel, Davide Del Testa, Karol Zieba, Urs Muller, Mathew Monfort, Mariusz Bojarski, Prasoon Goyal, Xin Zhang, Daniel Dworakowski, Bernhard Firner, Jiakai Zhang, Jake Zhao, End to End Learning for Self-Driving Cars arXiv: Computer Vision and Pattern Recognition. ,(2016)
Zhenhua Huang, Chuanqiang Lian, Xin Xu, Jian Wang, LATERAL CONTROL FOR AUTONOMOUS LAND VEHICLES VIA DUAL HEURISTIC PROGRAMMING international conference on robotics and automation. ,vol. 31, pp. 547- ,(2016) , 10.2316/JOURNAL.206.2016.6.206-4878
Zhenhua Huang, Xin Xu, Haibo He, Jun Tan, Zhenping Sun, Parameterized Batch Reinforcement Learning for Longitudinal Control of Autonomous Land Vehicles IEEE Transactions on Systems, Man, and Cybernetics. ,vol. 49, pp. 730- 741 ,(2019) , 10.1109/TSMC.2017.2712561
David Meger, Herke van Hoof, Scott Fujimoto, Addressing Function Approximation Error in Actor-Critic Methods arXiv: Artificial Intelligence. ,(2018)
Long Chen, Xuemin Hu, Wei Tian, Hong Wang, Dongpu Cao, Fei-Yue Wang, Parallel planning: a new motion planning framework for autonomous driving IEEE/CAA Journal of Automatica Sinica. ,vol. 6, pp. 236- 246 ,(2019) , 10.1109/JAS.2018.7511186
Florian Kuhnt, Peter Wolf, J. Marius Zöllner, Karl Kurzer, Tobias Wingert, Adaptive Behavior Generation for Autonomous Driving using Deep Reinforcement Learning with Compact Semantic States arXiv: Learning. ,(2018) , 10.1109/IVS.2018.8500427