Learning to segment with image-level annotations

作者: Yunchao Wei , Xiaodan Liang , Yunpeng Chen , Zequn Jie , Yanhui Xiao

DOI: 10.1016/J.PATCOG.2016.01.015

关键词:

摘要: Recently, deep convolutional neural networks (DCNNs) have significantly promoted the development of semantic image segmentation. However, previous works on learning segmentation network often rely a large number ground-truths with pixel-level annotations, which usually require considerable human effort. In this paper, we explore more challenging problem by to segment under image-level annotations. Specifically, our framework consists two components. First, reliable hypotheses based localization maps are generated incorporating hypotheses-aware classification and cross-image contextual refinement. Second, can be trained in supervised manner these maps. We training strategies for achieving good performance. For first strategy, novel multi-label cross-entropy loss is proposed train directly using multiple all classes, where each pixel contributes class different weights. second rough mask inferred from maps, then optimized single-label produced masks. evaluate methods PASCAL VOC 2012 benchmark. Extensive experimental results demonstrate effectiveness compared state-of-the-arts. HighlightsLocalization map generation hypothesis-based classification.A maps.An effective method predict given image.Our achieve new state-of-the-art

参考文章(41)
Jifeng Dai, Kaiming He, Jian Sun, BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation international conference on computer vision. pp. 1635- 1643 ,(2015) , 10.1109/ICCV.2015.191
Alan L. Yuille, Liang-Chieh Chen, Kevin Murphy, George Papandreou, Weakly- and Semi-Supervised Learning of a DCNN for Semantic Image Segmentation arXiv: Computer Vision and Pattern Recognition. ,(2015)
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
Deepak Pathak, Philipp Krahenbuhl, Trevor Darrell, Constrained Convolutional Neural Networks for Weakly Supervised Segmentation international conference on computer vision. pp. 1796- 1804 ,(2015) , 10.1109/ICCV.2015.209
Jonathan Long, Evan Shelhamer, Trevor Darrell, Fully convolutional networks for semantic segmentation computer vision and pattern recognition. pp. 3431- 3440 ,(2015) , 10.1109/CVPR.2015.7298965
Jiasen Lu, Ran Xu, Jason J. Corso, Human action segmentation with hierarchical supervoxel consistency computer vision and pattern recognition. pp. 3762- 3771 ,(2015) , 10.1109/CVPR.2015.7299000
Jifeng Dai, Kaiming He, Jian Sun, Convolutional feature masking for joint object and stuff segmentation computer vision and pattern recognition. pp. 3992- 4000 ,(2015) , 10.1109/CVPR.2015.7299025
Alan L. Yuille, Liang-Chieh Chen, Iasonas Kokkinos, Kevin Murphy, George Papandreou, Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs arXiv: Computer Vision and Pattern Recognition. ,(2014)
Jia Xu, Alexander G. Schwing, Raquel Urtasun, Learning to segment under various forms of weak supervision computer vision and pattern recognition. pp. 3781- 3790 ,(2015) , 10.1109/CVPR.2015.7299002
Evan Shelhamer, Jonathan Long, Deepak Pathak, Trevor Darrell, Fully Convolutional Multi-Class Multiple Instance Learning arXiv: Computer Vision and Pattern Recognition. ,(2014)