Interpretable UAV Collision Avoidance using Deep Reinforcement Learning.

作者: Deepak-George Thomas , Daniil Olshanskyi , Karter Krueger , Tichakorn Wongpiromsarn , Ali Jannesari

DOI:

关键词:

摘要: The significant components of any successful autonomous flight system are task completion and collision avoidance. Most deep learning algorithms successfully execute these aspects …

参考文章(37)
B. Moses Sathyaraj, L. C. Jain, A. Finn, S. Drake, Multiple UAVs path planning algorithms: a comparative study Fuzzy Optimization and Decision Making. ,vol. 7, pp. 257- 267 ,(2008) , 10.1007/S10700-008-9035-0
Hado V. Hasselt, Double Q-learning neural information processing systems. ,vol. 23, pp. 2613- 2621 ,(2010)
Matthew Hausknecht, Peter Stone, Deep Recurrent Q-Learning for Partially Observable MDPs arXiv: Learning. ,(2015)
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, Demis Hassabis, None, Human-level control through deep reinforcement learning Nature. ,vol. 518, pp. 529- 533 ,(2015) , 10.1038/NATURE14236
Tom Schaul, Marc Lanctot, Nando De Freitas, Ziyu Wang, Matteo Hessel, Hado Van Hasselt, Dueling network architectures for deep reinforcement learning international conference on machine learning. pp. 1995- 2003 ,(2016)
C. J. C. H. Watkins, Technical Note:Q-Learning Reinforcement Learning. pp. 55- 68 ,(1993)
Yoshua Bengio, Jérôme Louradour, Ronan Collobert, Jason Weston, Curriculum learning Proceedings of the 26th Annual International Conference on Machine Learning - ICML '09. pp. 41- 48 ,(2009) , 10.1145/1553374.1553380
Sašo Džeroski, Luc De Raedt, Hendrik Blockeel, Relational Reinforcement Learning inductive logic programming. ,vol. 2086, pp. 271- 280 ,(1998) , 10.1007/BFB0027307
Wenjie Pei, Tadas Baltrusaitis, David M. J. Tax, Louis-Philippe Morency, Temporal Attention-Gated Model for Robust Sequence Classification 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 820- 829 ,(2017) , 10.1109/CVPR.2017.94
James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei A Rusu, Kieran Milan, John Quan, Tiago Ramalho, Agnieszka Grabska-Barwinska, Demis Hassabis, Claudia Clopath, Dharshan Kumaran, Raia Hadsell, None, Overcoming catastrophic forgetting in neural networks Proceedings of the National Academy of Sciences of the United States of America. ,vol. 114, pp. 3521- 3526 ,(2017) , 10.1073/PNAS.1611835114