Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

作者: Alan L. Yuille , Liang-Chieh Chen , Iasonas Kokkinos , Kevin Murphy , George Papandreou

DOI:

关键词:

摘要: Deep Convolutional Neural Networks (DCNNs) have recently shown state of the art performance in high level vision tasks, such as image classification and object detection. This work brings together methods from DCNNs probabilistic graphical models for addressing task pixel-level (also called "semantic segmentation"). We show that responses at final layer are not sufficiently localized accurate segmentation. is due to very invariance properties make good tasks. overcome this poor localization property deep networks by combining DCNN with a fully connected Conditional Random Field (CRF). Qualitatively, our "DeepLab" system able localize segment boundaries accuracy which beyond previous methods. Quantitatively, method sets new state-of-art PASCAL VOC-2012 semantic segmentation task, reaching 71.6% IOU test set. how these results can be obtained efficiently: Careful network re-purposing novel application 'hole' algorithm wavelet community allow dense computation neural net 8 frames per second on modern GPU.

参考文章(41)
João Carreira, Rui Caseiro, Jorge Batista, Cristian Sminchisescu, Semantic Segmentation with Second-Order Pooling Computer Vision – ECCV 2012. pp. 430- 443 ,(2012) , 10.1007/978-3-642-33786-4_32
Pierre Sermanet, Yann LeCun, David Eigen, Rob Fergus, Michael Mathieu, Xiang Zhang, OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks arXiv: Computer Vision and Pattern Recognition. ,(2013)
Alan L. Yuille, Liang-Chieh Chen, Kevin Murphy, George Papandreou, Weakly- and Semi-Supervised Learning of a DCNN for Semantic Image Segmentation arXiv: Computer Vision and Pattern Recognition. ,(2015)
Davi Geiger, Alan Yuille, A common framework for image segmentation international conference on pattern recognition. ,vol. 6, pp. 227- 243 ,(1990) , 10.1007/BF00115697
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
Iasonas Kokkinos, George Papandreou, Pierre-André Savalle, Untangling Local and Global Deformations in Deep Convolutional Networks for Image Classification and Sliding Window Detection arXiv: Computer Vision and Pattern Recognition. ,(2014)
Matthew D. Zeiler, Rob Fergus, Visualizing and Understanding Convolutional Networks european conference on computer vision. pp. 818- 833 ,(2014) , 10.1007/978-3-319-10590-1_53
Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan Yuille, Towards unified depth and semantic prediction from a single image computer vision and pattern recognition. pp. 2800- 2809 ,(2015) , 10.1109/CVPR.2015.7298897
Dhruv Batra, Michael Cogswell, Senthil Purushwalkam, Xiao Lin, Combining the Best of Graphical Models and ConvNets for Semantic Segmentation arXiv: Computer Vision and Pattern Recognition. ,(2014)
Bharath Hariharan, Pablo Arbelaez, Ross Girshick, Jitendra Malik, Hypercolumns for object segmentation and fine-grained localization computer vision and pattern recognition. pp. 447- 456 ,(2015) , 10.1109/CVPR.2015.7298642