Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation

作者: Golnaz Ghiasi , Charless C. Fowlkes

DOI: 10.1007/978-3-319-46487-9_32

关键词:

摘要: CNN architectures have terrific recognition performance but rely on spatial pooling which makes it difficult to adapt them tasks that require dense, pixel-accurate labeling. This paper two contributions: (1) We demonstrate while the apparent resolution of convolutional feature maps is low, high-dimensional representation contains significant sub-pixel localization information. (2) describe a multi-resolution reconstruction architecture based Laplacian pyramid uses skip connections from higher and multiplicative gating successively refine segment boundaries reconstructed lower-resolution maps. approach yields state-of-the-art semantic segmentation results PASCAL VOC Cityscapes benchmarks without resorting more complex random-field inference or instance detection driven architectures.

参考文章(35)
Songfan Yang, Deva Ramanan, Multi-scale Recognition with DAG-CNNs international conference on computer vision. pp. 1215- 1223 ,(2015) , 10.1109/ICCV.2015.144
Arthur Szlam, Emily Denton, Rob Fergus, Soumith Chintala, Deep generative image models using a Laplacian pyramid of adversarial networks neural information processing systems. ,vol. 28, pp. 1486- 1494 ,(2015)
Saining Xie, Zhuowen Tu, Holistically-Nested Edge Detection international conference on computer vision. pp. 1395- 1403 ,(2015) , 10.1109/ICCV.2015.164
Jifeng Dai, Kaiming He, Jian Sun, BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation international conference on computer vision. pp. 1635- 1643 ,(2015) , 10.1109/ICCV.2015.191
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
Hyeonwoo Noh, Seunghoon Hong, Bohyung Han, Learning Deconvolution Network for Semantic Segmentation international conference on computer vision. pp. 1520- 1528 ,(2015) , 10.1109/ICCV.2015.178
Matthew D. Zeiler, Rob Fergus, Visualizing and Understanding Convolutional Networks european conference on computer vision. pp. 818- 833 ,(2014) , 10.1007/978-3-319-10590-1_53
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, C. Lawrence Zitnick, Microsoft COCO: Common Objects in Context Computer Vision – ECCV 2014. pp. 740- 755 ,(2014) , 10.1007/978-3-319-10602-1_48
Jonathan Long, Evan Shelhamer, Trevor Darrell, Fully convolutional networks for semantic segmentation computer vision and pattern recognition. pp. 3431- 3440 ,(2015) , 10.1109/CVPR.2015.7298965
Spyros Gidaris, Nikos Komodakis, Object Detection via a Multi-region and Semantic Segmentation-Aware CNN Model international conference on computer vision. pp. 1134- 1142 ,(2015) , 10.1109/ICCV.2015.135