Active learning for semantic segmentation with expected change

作者: A. Vezhnevets , J. M. Buhmann , V. Ferrari

DOI: 10.1109/CVPR.2012.6248050

关键词:

摘要: We address the problem of semantic segmentation: classifying each pixel in an image according to class it belongs (e.g. dog, road, car). Most existing methods train from fully supervised images, where is annotated by a label. To reduce annotation effort, recently few weakly approaches emerged. These require only labels indicating which classes are present. Although their performance reaches satisfactory level, there still substantial gap between accuracy and methods. this with novel active learning method specifically suited for setting. model as pairwise CRF cast finding its most informative nodes. nodes induce largest expected change overall state, after revealing true Our criterion equivalent maximizing upper-bound on gain. Experiments two data-sets show that our achieves 97% percent corresponding model, while querying less than 17% (super-)pixel labels.

参考文章(31)
Carlos Guestrin, Andreas Krause, Near-optimal nonmyopic value of information in graphical models uncertainty in artificial intelligence. pp. 324- 331 ,(2005)
Jamie Shotton, John Winn, Carsten Rother, Antonio Criminisi, TextonBoost : joint appearance, shape and context modeling for multi-class object recognition and segmentation european conference on computer vision. ,vol. 1, pp. 1- 15 ,(2006) , 10.1007/11744023_1
Lubor Ladicky, Chris Russell, Pushmeet Kohli, Philip H. S. Torr, Graph cut based inference with co-occurrence statistics european conference on computer vision. pp. 239- 253 ,(2010) , 10.1007/978-3-642-15555-0_18
Mikhail Belkin, Partha Niyogi, Semi-Supervised Learning on Riemannian Manifolds Machine Learning. ,vol. 56, pp. 209- 239 ,(2004) , 10.1023/B:MACH.0000033120.25363.1E
Richard M. Karp, Reducibility Among Combinatorial Problems Journal of Symbolic Logic. ,vol. 40, pp. 219- 241 ,(2010) , 10.1007/978-3-540-68279-0_8
Behjat Siddiquie, Abhinav Gupta, Beyond active noun tagging: Modeling contextual interactions for multi-class active learning computer vision and pattern recognition. pp. 2979- 2986 ,(2010) , 10.1109/CVPR.2010.5540044
Sudheendra Vijayanarasimhan, Kristen Grauman, Large-scale live active learning: Training object detectors with crawled data and crowds computer vision and pattern recognition. pp. 1449- 1456 ,(2011) , 10.1109/CVPR.2011.5995430
Alexander Vezhnevets, Joachim M. Buhmann, Towards weakly supervised semantic segmentation by means of multiple instance and multitask learning computer vision and pattern recognition. pp. 3249- 3256 ,(2010) , 10.1109/CVPR.2010.5540060
Jamie Shotton, Matthew Johnson, Roberto Cipolla, Semantic texton forests for image categorization and segmentation computer vision and pattern recognition. pp. 1- 8 ,(2008) , 10.1109/CVPR.2008.4587503
V. Kolmogorov, R. Zabih, What energy functions can be minimized via graph cuts IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 26, pp. 147- 159 ,(2004) , 10.1109/TPAMI.2004.1262177