Accelerating Grasp Exploration by Leveraging Learned Priors

作者: Ken Goldberg , Michael Danielczuk , Vishal Satish , Ashwin Balakrishna , Han Yu Li

DOI:

关键词: Artificial intelligenceObject (computer science)Machine learningGRASPRobotComputer scienceSet (psychology)

摘要: The ability of robots to grasp novel objects has industry applications in e-commerce order fulfillment and home service. Data-driven grasping policies have achieved success learning general strategies for arbitrary objects. However, these approaches can fail which complex geometry or are significantly outside the training distribution. We present a Thompson sampling algorithm that learns given object with unknown using online experience. leverages learned priors from Dexterity Network robot planner guide exploration provide probabilistic estimates each stable pose object. find seeding policy Dex-Net prior allows it more efficiently robust grasps on Experiments suggest best attains an average total reward 64.5% higher than greedy baseline achieves within 5.7% oracle when evaluated over 300,000 runs across set 3000 poses.

参考文章(26)
Daniel Kappler, Jeannette Bohg, Stefan Schaal, Leveraging big data for grasp planning international conference on robotics and automation. pp. 4304- 4311 ,(2015) , 10.1109/ICRA.2015.7139793
S. Shankar Sastry, Richard M. Murray, Li Zexiang, A Mathematical Introduction to Robotic Manipulation ,(1994)
Michael Laskey, Jeff Mahler, Zoe McCarthy, Florian T. Pokorny, Sachin Patil, Jur van den Berg, Danica Kragic, Pieter Abbeel, Ken Goldberg, Multi-armed bandit models for 2D grasp planning with uncertainty conference on automation science and engineering. pp. 572- 579 ,(2015) , 10.1109/COASE.2015.7294140
A. Bicchi, V. Kumar, Robotic grasping and contact: a review international conference on robotics and automation. ,vol. 1, pp. 348- 353 ,(2000) , 10.1109/ROBOT.2000.844081
M. G. KENDALL, THE TREATMENT OF TIES IN RANKING PROBLEMS Biometrika. ,vol. 33, pp. 239- 251 ,(1945) , 10.1093/BIOMET/33.3.239
M. G. KENDALL, A NEW MEASURE OF RANK CORRELATION Biometrika. ,vol. 30, pp. 81- 93 ,(1938) , 10.1093/BIOMET/30.1-2.81
Ian Lenz, Honglak Lee, Ashutosh Saxena, Deep learning for detecting robotic grasps The International Journal of Robotics Research. ,vol. 34, pp. 705- 724 ,(2015) , 10.1177/0278364914549607
Jeannette Bohg, Antonio Morales, Tamim Asfour, Danica Kragic, Data-Driven Grasp Synthesis—A Survey IEEE Transactions on Robotics. ,vol. 30, pp. 289- 309 ,(2014) , 10.1109/TRO.2013.2289018
Ashutosh Saxena, Justin Driemeyer, Andrew Y. Ng, Robotic Grasping of Novel Objects using Vision The International Journal of Robotics Research. ,vol. 27, pp. 157- 173 ,(2008) , 10.1177/0278364907087172
Olivier Chapelle, Lihong Li, An Empirical Evaluation of Thompson Sampling neural information processing systems. ,vol. 24, pp. 2249- 2257 ,(2011)