A Collision-Free Path Planning Method Using Direct Behavior Cloning

作者: Zijing Chi , Lei Zhu , Fan Zhou , Chungang Zhuang

DOI: 10.1007/978-3-030-27538-9_45

关键词: State of the EnvironmentObstacle avoidanceObstacleRobotic armImitation learningDeep learningMotion planningArtificial intelligenceComputer sciencePython (programming language)

摘要: An effective path planning approach based on deep learning for robotic arms is presented in this paper. Direct behavior cloning applied to extract the obstacle avoidance policy collision-free paths generated by reliable motion planners, such as RRT* algorithm our case. Behavior simplest form of imitation learning, also known Learning from Demonstration (LfD), where an agent tries learn a recover expert’s action with respect state environment. The designed paper gives scene knowing pose obstacle, initial and goal configurations. taken each time changes thus method able achieve online regardless whether environment static or dynamic. We build simulation V-REP Python client program collect state-action dataset validate trained policies. Policy models without visual input are constructed tested same experiment setting determine best solution. Results show that model accurate handles issue well.

参考文章(14)
Ladislau Bölöni, Aman Behal, Rouhollah Rahmatizadeh, Pooya Abolghasemi, From virtual demonstration to real-world manipulation using LSTM and MDN arXiv: Robotics. ,(2016)
Jan Kautz, Iuri Frosio, Stephen Tyree, Mengyuan Yan, Sim-to-Real Transfer of Accurate Grasping with Eye-In-Hand Observations and Continuous Control. arXiv: Robotics. ,(2017)
Mrinal Kalakrishnan, Vincent Vanhoucke, Sergey Levine, Peter Pastor, Alexander Herzog, Ethan Holly, Deirdre Quillen, Julian Ibarz, Eric Jang, Alex Irpan, Dmitry Kalashnikov, QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation arXiv: Learning. ,(2018)
Zaid Tahir, Ahmed H. Qureshi, Yasar Ayaz, Raheel Nawaz, Potentially guided bidirectionalized RRT* for fast optimal path planning in cluttered environments Robotics and Autonomous Systems. ,vol. 108, pp. 13- 27 ,(2018) , 10.1016/J.ROBOT.2018.06.013
Fereshteh Sadeghi, Alexander Toshev, Eric Jang, Sergey Levine, Sim2Real Viewpoint Invariant Visual Servoing by Recurrent Control 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 4691- 4699 ,(2018) , 10.1109/CVPR.2018.00493
Tu-Hoa Pham, Giovanni De Magistris, Ryuki Tachibana, OptLayer - Practical Constrained Optimization for Deep Reinforcement Learning in the Real World 2018 IEEE International Conference on Robotics and Automation (ICRA). pp. 6236- 6243 ,(2018) , 10.1109/ICRA.2018.8460547
Brian Ichter, James Harrison, Marco Pavone, Learning Sampling Distributions for Robot Motion Planning 2018 IEEE International Conference on Robotics and Automation (ICRA). pp. 7087- 7094 ,(2018) , 10.1109/ICRA.2018.8460730
Yuke Zhu, Ziyu Wang, Josh Merel, Andrei Rusu, Tom Erez, Serkan Cabi, Saran Tunyasuvunakool, János Kramár, Raia Hadsell, Nando de Freitas, Nicolas Heess, Reinforcement and Imitation Learning for Diverse Visuomotor Skills robotics science and systems. ,vol. 14, ,(2018) , 10.15607/RSS.2018.XIV.009
Jost Tobias Springenberg, Volodymyr Mnih, Michael Neunert, Thomas Lampe, Martin A. Riedmiller, Roland Hafner, Jonas Degrave, Nicolas Heess, Tom Van de Wiele, Learning by Playing - Solving Sparse Reward Tasks from Scratch international conference on machine learning. pp. 4341- 4350 ,(2018)
Rouhollah Rahmatizadeh, Pooya Abolghasemi, Ladislau Boloni, Sergey Levine, Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-to-End Learning from Demonstration international conference on robotics and automation. pp. 3758- 3765 ,(2018) , 10.1109/ICRA.2018.8461076