Learning to be a depth camera for close-range human capture and interaction

作者: Sean Ryan Fanello , Cem Keskin , Shahram Izadi , Pushmeet Kohli , David Kim

DOI: 10.1145/2601097.2601223

关键词:

摘要: We present a machine learning technique for estimating absolute, per-pixel depth using any conventional monocular 2D camera, with minor hardware modifications. Our approach targets close-range human capture and interaction where dense 3D estimation of hands and faces is desired. We use hybrid classification-regression forests to learn how to map from near infrared intensity images to absolute, metric depth in real-time. We demonstrate a variety of human-computer interaction and capture scenarios. Experiments show an …

参考文章(51)
Oliver Vogel, Michael Breuß, Thomas Leichtweis, Joachim Weickert, Fast Shape from Shading for Phong-Type Surfaces international conference on scale space and variational methods in computer vision. pp. 733- 744 ,(2009) , 10.1007/978-3-642-02256-2_61
Sabine Süsstrunk, Clément Fredembach, Colouring the near infrared color imaging conference. pp. 176- 182 ,(2008)
Cem Keskin, Furkan Kıraç, Yunus Emre Kara, Lale Akarun, Hand pose estimation and hand shape classification using multi-layered randomized decision forests european conference on computer vision. pp. 852- 863 ,(2012) , 10.1007/978-3-642-33783-3_61
Bernhard Schölkopf, Peter Gehler, Carsten Rother, Lumin Zhang, Martin Kiefel, Recovering Intrinsic Images with a Global Sparsity Prior on Reflectance Untitled Event. pp. 765- 773 ,(2011)
Jamie Shotton, John Winn, Carsten Rother, Antonio Criminisi, TextonBoost : joint appearance, shape and context modeling for multi-class object recognition and segmentation european conference on computer vision. ,vol. 1, pp. 1- 15 ,(2006) , 10.1007/11744023_1
Kevin Karsch, Ce Liu, Sing Bing Kang, Depth Extraction from Video Using Non-parametric Sampling Computer Vision – ECCV 2012. pp. 775- 788 ,(2012) , 10.1007/978-3-642-33715-4_56
M.Z. Brown, D. Burschka, G.D. Hager, Advances in computational stereo IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 25, pp. 993- 1008 ,(2003) , 10.1109/TPAMI.2003.1217603
Richard A. Newcombe, Andrew Fitzgibbon, Shahram Izadi, Otmar Hilliges, David Molyneaux, David Kim, Andrew J. Davison, Pushmeet Kohi, Jamie Shotton, Steve Hodges, KinectFusion: Real-time dense surface mapping and tracking international symposium on mixed and augmented reality. pp. 127- 136 ,(2011) , 10.1109/ISMAR.2011.6092378
Dilip Krishnan, Rob Fergus, Dark flash photography international conference on computer graphics and interactive techniques. ,vol. 28, pp. 96- ,(2009) , 10.1145/1531326.1531402