3DNN: 3D Nearest Neighbor

作者: Scott Satkin , Maheen Rashid , Jason Lin , Martial Hebert

DOI: 10.1007/S11263-014-0734-4

关键词: Rendering (computer graphics)Computer visionAffordanceLeverage (statistics)Computer scienceSimilarity measureObject detectionArtificial intelligenceMachine learningViewpointsk-nearest neighbors algorithmSegmentation

摘要: In this paper, we describe a data-driven approach to leverage repositories of 3D models for scene understanding. Our ability relate what see in an image large collection allows us transfer information from these models, creating rich understanding the scene. We develop framework auto-calibrating camera, rendering viewpoint was taken, and computing similarity measure between each model input image. demonstrate context geometry estimation show find identities, poses styles objects The true benefit 3DNN compared traditional 2D nearest-neighbor is that by generalizing across viewpoints, free ourselves need have training examples captured all possible viewpoints. Thus, are able achieve comparable results using orders magnitude less data, recognize never-before-seen work, algorithm rigorously evaluate its performance tasks object detection/segmentation, as well two novel applications: affordance photorealistic insertion.

参考文章(69)
K. Lai, D. Fox, 3D laser scan classification using web data and domain adaptation robotics science and systems. ,vol. 05, ,(2009) , 10.15607/RSS.2009.V.022
Georges Baatz, Olivier Saurer, Kevin Köser, Marc Pollefeys, Large Scale Visual Geo-Localization of Images in Mountainous Terrain Computer Vision – ECCV 2012. pp. 517- 530 ,(2012) , 10.1007/978-3-642-33709-3_37
Nathan Silberman, Derek Hoiem, Pushmeet Kohli, Rob Fergus, Indoor Segmentation and Support Inference from RGBD Images Computer Vision – ECCV 2012. pp. 746- 760 ,(2012) , 10.1007/978-3-642-33715-4_54
Ce Liu, Jenny Yuen, Antonio Torralba, Josef Sivic, William T. Freeman, SIFT Flow: Dense Correspondence across Different Scenes Lecture Notes in Computer Science. pp. 28- 42 ,(2008) , 10.1007/978-3-540-88690-7_3
Alexander G. Schwing, Raquel Urtasun, Efficient exact inference for 3d indoor scene understanding european conference on computer vision. pp. 299- 313 ,(2012) , 10.1007/978-3-642-33783-3_22
Abhinav Gupta, Alexei A. Efros, Martial Hebert, Blocks World Revisited: Image Understanding Using Qualitative Geometry and Mechanics Computer Vision – ECCV 2010. pp. 482- 496 ,(2010) , 10.1007/978-3-642-15561-1_35
Aude Oliva, Antonio Torralba, Building the gist of a scene: the role of global image features in recognition. Progress in Brain Research. ,vol. 155, pp. 23- 36 ,(2006) , 10.1016/S0079-6123(06)55002-2
Joseph Tighe, Svetlana Lazebnik, Superparsing: scalable nonparametric image parsing with superpixels european conference on computer vision. pp. 352- 365 ,(2010) , 10.1007/978-3-642-15555-0_26
Aude Oliva, Antonio Torralba, Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope International Journal of Computer Vision. ,vol. 42, pp. 145- 175 ,(2001) , 10.1023/A:1011139631724