Putting the User in the Loop for Image-Based Modeling

作者: Adarsh Kowdle , Yao-Jen Chang , Andrew Gallagher , Dhruv Batra , Tsuhan Chen

DOI: 10.1007/S11263-014-0704-X

关键词: Active learning (machine learning)Computer scienceUser modelingPattern recognition (psychology)Graph (abstract data type)Object (computer science)Computer visionUser-in-the-loop3D modelingArtificial intelligenceMarkov random field

摘要: We refer to the task of recovering 3D structure an object or a scene using 2D images as image-based modeling. In this paper, we formulate discrete optimization problem solved via energy minimization. standard framework Markov random field (MRF) defined over image present algorithms that allow user intuitively interact with algorithm. introduce algorithm where guides process modeling find and model interest by manually interacting nodes graph. develop end applications on mobile device printing interest. also propose alternate active learning input. An initial attempt is made at reconstructing without supervision. Given reconstruction, uses intuitive cues quantify uncertainty suggest regions, querying provide support for uncertain regions simple scribbles. These constraints are used update unary pairwise energies that, when solved, lead better reconstructions. show through machine experiments study proposed approach intelligently queries users constraints, achieve reconstructions faster, especially scenes textureless surfaces lacking strong textural structural typically require.

参考文章(52)
Christian Theobalt, Sebastian Thrun, Andrew Y. Ng, Ashutosh Saxena, Savil Srivastava, i23 - Rapid Interactive 3D Reconstruction from a Single Image. vision modeling and visualization. pp. 19- 28 ,(2009)
Bruce Guenther Baumgart, Geometric modeling for computer vision. Stanford University. ,(1974)
Neill D. F. Campbell, George Vogiatzis, Carlos Hernández, Roberto Cipolla, Using Multiple Hypotheses to Improve Depth-Maps for Multi-View Stereo Lecture Notes in Computer Science. pp. 766- 779 ,(2008) , 10.1007/978-3-540-88682-2_58
Adarsh Kowdle, Sudipta N. Sinha, Richard Szeliski, Multiple View Object Cosegmentation Using Appearance and Stereo Cues Computer Vision – ECCV 2012. pp. 789- 803 ,(2012) , 10.1007/978-3-642-33715-4_57
Adarsh Kowdle, Dhruv Batra, Wen-Chao Chen, Tsuhan Chen, iModel: interactive co-segmentation for object of interest 3d modeling european conference on computer vision. pp. 211- 224 ,(2010) , 10.1007/978-3-642-35740-4_17
Brendan Collins, Jia Deng, Kai Li, Li Fei-Fei, Towards Scalable Dataset Construction: An Active Learning Approach Lecture Notes in Computer Science. pp. 86- 98 ,(2008) , 10.1007/978-3-540-88682-2_8
Keith Forbes, Fred Nicolls, Gerhard de Jager, Anthon Voigt, Shape-from-Silhouette with two mirrors and an uncalibrated camera european conference on computer vision. pp. 165- 178 ,(2006) , 10.1007/11744047_13
Paul E. Debevec, Camillo J. Taylor, Jitendra Malik, Modeling and rendering architecture from photographs: a hybrid geometry- and image-based approach international conference on computer graphics and interactive techniques. pp. 11- 20 ,(1996) , 10.1145/237170.237191
Xiang Sean Zhou, Thomas S. Huang, Relevance feedback in image retrieval: A comprehensive review Multimedia Systems. ,vol. 8, pp. 536- 544 ,(2003) , 10.1007/S00530-002-0070-3
Pedro F. Felzenszwalb, Daniel P. Huttenlocher, Efficient Graph-Based Image Segmentation International Journal of Computer Vision. ,vol. 59, pp. 167- 181 ,(2004) , 10.1023/B:VISI.0000022288.19776.77