BigBIRD: A large-scale 3D database of object instances

作者: Arjun Singh , James Sha , Karthik S. Narayan , Tudor Achim , Pieter Abbeel

DOI: 10.1109/ICRA.2014.6906903

关键词: RobotCode (cryptography)Component-based software engineeringPoint cloudComputer visionState (computer science)Artificial intelligenceInformation retrieval3D single-object recognitionComputer scienceObject (computer science)

摘要: The state of the art in computer vision has rapidly advanced over past decade largely aided by shared image datasets. However, most these datasets tend to consist assorted collections images from web that do not include 3D information or pose information. Furthermore, they target problem object category recognition—whereas solving instance recognition might be sufficient for many robotic tasks. To address issues, we present a highquality, large-scale dataset instances, with accurate calibration every image. We anticipate “solving” this will effectively remove perceptionrelated problems mobile, sensing-based robots. contributions work of: (1) BigBIRD, 100 objects (and growing), composed of, each object, 600 point clouds and high-resolution (12 MP) spanning all views, (2) method jointly calibrating multi-camera system, (3) details our data collection which collects required single under 6 minutes minimal human effort, (4) multiple software components (made available open source), used automate multi-sensor process. All code are at http://rll.eecs.berkeley.edu/bigbird.

参考文章(32)
Paolo Cignoni, Massimiliano Corsini, Guido Ranzuglia, MeshLab: an Open-Source 3D Mesh Processing System. Ercim News. ,vol. 2008, ,(2008)
Nathan Silberman, Derek Hoiem, Pushmeet Kohli, Rob Fergus, Indoor Segmentation and Support Inference from RGBD Images Computer Vision – ECCV 2012. pp. 746- 760 ,(2012) , 10.1007/978-3-642-33715-4_54
Daniel Herrera C., Juho Kannala, Janne Heikkilä, Accurate and practical calibration of a depth and color camera pair computer analysis of images and patterns. pp. 437- 445 ,(2011) , 10.1007/978-3-642-23678-5_52
Michael Kaess, Maurice Fallon, John McDonald, Thomas Whelan, John J. Leonard, Hordur Johannsson, Kintinuous: Spatially Extended KinectFusion national conference on artificial intelligence. ,(2012)
Marwan Mattar, Tamara Berg, Gary B. Huang, Eric Learned-Miller, Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments Workshop on Faces in 'Real-Life' Images: Detection, Alignment, and Recognition. ,(2008)
Jan Smisek, Michal Jancosek, Tomas Pajdla, 3D with Kinect international conference on computer vision. pp. 1154- 1160 ,(2011) , 10.1007/978-1-4471-4640-7_1
Richard A. Newcombe, Andrew Fitzgibbon, Shahram Izadi, Otmar Hilliges, David Molyneaux, David Kim, Andrew J. Davison, Pushmeet Kohi, Jamie Shotton, Steve Hodges, KinectFusion: Real-time dense surface mapping and tracking international symposium on mixed and augmented reality. pp. 127- 136 ,(2011) , 10.1109/ISMAR.2011.6092378
Andreas Geiger, Frank Moosmann, Omer Car, Bernhard Schuster, Automatic camera and range sensor calibration using a single shot international conference on robotics and automation. pp. 3936- 3943 ,(2012) , 10.1109/ICRA.2012.6224570
Michael Kazhdan, Matthew Bolitho, Hugues Hoppe, Poisson surface reconstruction symposium on geometry processing. pp. 61- 70 ,(2006) , 10.5555/1281957.1281965
Yasutaka Furukawa, Brian Curless, Steven M. Seitz, Richard Szeliski, Towards Internet-scale multi-view stereo computer vision and pattern recognition. pp. 1434- 1441 ,(2010) , 10.1109/CVPR.2010.5539802