Learning object class detectors from weakly annotated video

作者: Alessandro Prest , C. Leistner , J. Civera , C. Schmid , V. Ferrari

DOI: 10.1109/CVPR.2012.6248065

关键词:

摘要: Object detectors are typically trained on a large set of still images annotated by bounding-boxes. This paper introduces an approach for learning object from real-world web videos known only to contain objects target class. We propose fully automatic pipeline that localizes in the class and learns detector it. The extracts candidate spatio-temporal tubes based motion segmentation then selects one tube per video jointly over all videos. To compare state art, we test our images, i.e., Pascal VOC 2007. observe frames extracted can differ significantly terms quality taken good camera. Thus, formulate as domain adaptation task. show training combination weakly using improves performance alone.

参考文章(36)
Thomas Brox, Jitendra Malik, None, Object segmentation by long term analysis of point trajectories european conference on computer vision. pp. 282- 295 ,(2010) , 10.1007/978-3-642-15555-0_21
David J. Crandall, Daniel P. Huttenlocher, Weakly supervised learning of part-based spatial models for visual object recognition european conference on computer vision. pp. 16- 29 ,(2006) , 10.1007/11744023_2
Aude Oliva, Antonio Torralba, Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope International Journal of Computer Vision. ,vol. 42, pp. 145- 175 ,(2001) , 10.1023/A:1011139631724
Herbert Bay, Tinne Tuytelaars, Luc Van Gool, SURF: speeded up robust features european conference on computer vision. ,vol. 1, pp. 404- 417 ,(2006) , 10.1007/11744023_32
Thomas Deselaers, Bogdan Alexe, Vittorio Ferrari, Localizing Objects While Learning Their Appearance Computer Vision – ECCV 2010. pp. 452- 466 ,(2010) , 10.1007/978-3-642-15561-1_33
Karim All, David Hasler, Frangois Fleuret, FlowBoost — Appearance learning from sparsely annotated video computer vision and pattern recognition. pp. 1433- 1440 ,(2011) , 10.1109/CVPR.2011.5995403
Yusuf Aytar, Andrew Zisserman, Tabula rasa: Model transfer for object category detection international conference on computer vision. pp. 2252- 2259 ,(2011) , 10.1109/ICCV.2011.6126504
Sudheendra Vijayanarasimhan, Kristen Grauman, Large-scale live active learning: Training object detectors with crawled data and crowds computer vision and pattern recognition. pp. 1449- 1456 ,(2011) , 10.1109/CVPR.2011.5995430
Antonio Torralba, Alexei A. Efros, Unbiased look at dataset bias computer vision and pattern recognition. pp. 1521- 1528 ,(2011) , 10.1109/CVPR.2011.5995347
Himanshu Arora, Nicolas Loeff, David A. Forsyth, Narendra Ahuja, Unsupervised Segmentation of Objects using Efficient Learning computer vision and pattern recognition. pp. 1- 7 ,(2007) , 10.1109/CVPR.2007.383011