Deep Learning for Single-View Instance Recognition

作者: Silvio Savarese , Sebastian Thrun , David Held

DOI:

关键词:

摘要: Deep learning methods have typically been trained on large datasets in which many training examples are available. However, real-world product only a small number of images available for each product. We explore the use deep recognizing object instances when we single example per class. show that feedforward neural networks outperform state-of-the-art objects from novel viewpoints even just image object. To further improve our performance this task, propose to take advantage supplementary dataset observe separate set multiple viewpoints. introduce new approach instance recognition with limited data, an auxiliary multi-view train network be robust viewpoint changes. find leads more classifier viewpoints, outperforming previous approaches including keypoint-matching, template-based techniques, and sparse coding.

参考文章(50)
Philipp Krähenbühl, Vladlen Koltun, Geodesic Object Proposals Computer Vision – ECCV 2014. pp. 725- 739 ,(2014) , 10.1007/978-3-319-10602-1_47
Thomas Leung, Jitendra Malik, Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons International Journal of Computer Vision. ,vol. 43, pp. 29- 44 ,(2001) , 10.1023/A:1011126920638
Liefeng Bo, Xiaofeng Ren, Dieter Fox, Unsupervised Feature Learning for RGB-D Based Object Recognition Experimental Robotics. pp. 387- 402 ,(2013) , 10.1007/978-3-319-00065-7_27
Vittorio Ferrari, Tinne Tuytelaars, Luc Van Gool, Simultaneous Object Recognition and Segmentation by Image Exploration european conference on computer vision. pp. 40- 54 ,(2004) , 10.1007/978-3-540-24670-1_4
Herbert Bay, Tinne Tuytelaars, Luc Van Gool, SURF: speeded up robust features european conference on computer vision. ,vol. 1, pp. 404- 417 ,(2006) , 10.1007/11744023_32
Matthew D. Zeiler, Rob Fergus, Visualizing and Understanding Convolutional Networks european conference on computer vision. pp. 818- 833 ,(2014) , 10.1007/978-3-319-10590-1_53
Edward Hsiao, Martial Hebert, Occlusion Reasoning for Object Detectionunder Arbitrary Viewpoint IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 36, pp. 1803- 1815 ,(2014) , 10.1109/TPAMI.2014.2303085
Arjun Singh, James Sha, Karthik S. Narayan, Tudor Achim, Pieter Abbeel, BigBIRD: A large-scale 3D database of object instances international conference on robotics and automation. pp. 509- 516 ,(2014) , 10.1109/ICRA.2014.6906903
Tatiana Tommasi, Barbara Caputo, The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories british machine vision conference. pp. 1- 11 ,(2009) , 10.5244/C.23.80
Judy Hoffman, Erik Rodner, Jeff Donahue, Brian Kulis, Kate Saenko, Asymmetric and Category Invariant Feature Transformations for Domain Adaptation International Journal of Computer Vision. ,vol. 109, pp. 28- 41 ,(2014) , 10.1007/S11263-014-0719-3