Three dimensional representation and reasoning for indoor scene understanding

作者: Takeo Kanade , David C. Lee

DOI:

关键词:

摘要: When addressing the problem of scene understanding from a single image, we want our system to understand not only where objects are in but also they 3D world. Segmenting and labeling regions 2D image plane does achieve this goal. We need representation that inherently encodes properties scene. In addition location 3D, make use physical knowledge about valid configurations world by rejecting violate constraints, such as two occupying same volume. geometric can aid detecting identifying certain classess well characterized their geometry. thesis, will demonstrate benefits using for indoor understanding. show models provides natural way represent inject have perform reasoning.

参考文章(56)
Abhinav Gupta, Alexei A. Efros, Martial Hebert, Blocks World Revisited: Image Understanding Using Qualitative Geometry and Mechanics Computer Vision – ECCV 2010. pp. 482- 496 ,(2010) , 10.1007/978-3-642-15561-1_35
Erick Delage, Honglak Lee, Andrew Y. Ng, Automatic Single-Image 3d Reconstructions of Indoor Manhattan World Scenes Springer Tracts in Advanced Robotics. ,vol. 28, pp. 305- 321 ,(2007) , 10.1007/978-3-540-48113-3_28
Jamie Shotton, John Winn, Carsten Rother, Antonio Criminisi, TextonBoost : joint appearance, shape and context modeling for multi-class object recognition and segmentation european conference on computer vision. ,vol. 1, pp. 1- 15 ,(2006) , 10.1007/11744023_1
W Eric, L Grimson, Tomás Lozano-Pérez, Model-based recognition and localization from sparse range or tactile data The International Journal of Robotics Research. ,vol. 3, pp. 382- 414 ,(1987) , 10.1177/027836498400300301
David L. Waltz, Generating Semantic Descriptions From Drawings of Scenes With Shadows Generating Semantic Descriptions From Drawings of Scenes With Shadows. ,(1972)
Antonio Criminisi, Ian Reid, Andrew Zisserman, Single View Metrology International Journal of Computer Vision. ,vol. 40, pp. 123- 148 ,(2000) , 10.1023/A:1026598000963
Alex Flint, Christopher Mei, David Murray, Ian Reid, A dynamic programming approach to reconstructing building interiors european conference on computer vision. pp. 394- 407 ,(2010) , 10.1007/978-3-642-15555-0_29
Varsha Hedau, Derek Hoiem, David Forsyth, Thinking Inside the Box: Using Appearance Models and Context Based on Room Geometry Computer Vision – ECCV 2010. pp. 224- 237 ,(2010) , 10.1007/978-3-642-15567-3_17