Efficient piecewise training of deep structured models for semantic segmentation

作者: Ian Reid , Chunhua Shen , Guosheng Lin , Anton van dan Hengel

DOI:

关键词:

摘要: Recent advances in semantic image segmentation have mostly been achieved by training deep convolutional neural networks (CNNs). We show how to improve through the use of contextual information; specifically, we explore `patch-patch' context between regions, and `patch-background' context. For learning from patch-patch context, formulate Conditional Random Fields (CRFs) with CNN-based pairwise potential functions capture correlations neighboring patches. Efficient piecewise proposed structured model is then applied avoid repeated expensive CRF inference for back propagation. capturing patch-background that a network design traditional multi-scale input sliding pyramid pooling effective improving performance. Our experimental results set new state-of-the-art performance on number popular datasets, including NYUDv2, PASCAL VOC 2012, PASCAL-Context, SIFT-flow. In particular, achieve an intersection-over-union score 78.0 challenging 2012 dataset.

参考文章(42)
Serge Belongie, C. Lawrence Zitnick, Deva Ramanan, Piotr Dollár, Pietro Perona, James Hays, Michael Maire, Ross Girshick, Lubomir Bourdev, Tsung-Yi Lin, Microsoft COCO: Common Objects in Context arXiv: Computer Vision and Pattern Recognition. ,(2014)
Christoph Bregler, Yann LeCun, Jonathan Tompson, Arjun Jain, Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation arXiv: Computer Vision and Pattern Recognition. ,(2014)
Hyeonwoo Noh, Seunghoon Hong, Bohyung Han, Learning Deconvolution Network for Semantic Segmentation arXiv: Computer Vision and Pattern Recognition. ,(2015)
Karel Lenc, Andrea Vedaldi, MatConvNet: Convolutional Neural Networks for MATLAB arXiv: Computer Vision and Pattern Recognition. ,(2014)
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition international conference on learning representations. ,(2015)
Alan L. Yuille, Liang-Chieh Chen, Iasonas Kokkinos, Kevin Murphy, George Papandreou, Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs international conference on learning representations. ,(2015)
Chao Dong, Chen Change Loy, Kaiming He, Xiaoou Tang, Learning a Deep Convolutional Network for Image Super-Resolution european conference on computer vision. pp. 184- 199 ,(2014) , 10.1007/978-3-319-10593-2_13
João Carreira, Rui Caseiro, Jorge Batista, Cristian Sminchisescu, Semantic Segmentation with Second-Order Pooling Computer Vision – ECCV 2012. pp. 430- 443 ,(2012) , 10.1007/978-3-642-33786-4_32
Nathan Silberman, Derek Hoiem, Pushmeet Kohli, Rob Fergus, Indoor Segmentation and Support Inference from RGBD Images Computer Vision – ECCV 2012. pp. 746- 760 ,(2012) , 10.1007/978-3-642-33715-4_54
Carl Doersch, Abhinav Gupta, Alexei A. Efros, Context as Supervisory Signal: Discovering Objects with Predictable Context european conference on computer vision. pp. 362- 377 ,(2014) , 10.1007/978-3-319-10578-9_24