Convolutional feature masking for joint object and stuff segmentation

作者: Jifeng Dai , Kaiming He , Jian Sun

DOI: 10.1109/CVPR.2015.7299025

关键词:

摘要: The topic of semantic segmentation has witnessed considerable progress due to the powerful features learned by convolutional neural networks (CNNs) [13]. current leading approaches for exploit shape information extracting CNN from masked image regions. This strategy introduces artificial boundaries on images and may impact quality extracted features. Besides, operations raw domain require compute thousands a single image, which is time-consuming. In this paper, we propose via masking proposal segments (e.g., super-pixels) are treated as masks feature maps. directly out these maps used train classifiers recognition. We further joint method handle objects “stuff” grass, sky, water) in same framework. State-of-the-art results demonstrated benchmarks PASCAL VOC new PASCAL-CONTEXT, with compelling computational speed.

参考文章(25)
João Carreira, Rui Caseiro, Jorge Batista, Cristian Sminchisescu, Semantic Segmentation with Second-Order Pooling Computer Vision – ECCV 2012. pp. 430- 443 ,(2012) , 10.1007/978-3-642-33786-4_32
Ross Girshick, Jitendra Malik, Bharath Hariharan, Pablo Arbeláez, Simultaneous Detection and Segmentation european conference on computer vision. pp. 297- 312 ,(2014) , 10.1007/978-3-319-10584-0_20
Jamie Shotton, John Winn, Carsten Rother, Antonio Criminisi, TextonBoost : joint appearance, shape and context modeling for multi-class object recognition and segmentation european conference on computer vision. ,vol. 1, pp. 1- 15 ,(2006) , 10.1007/11744023_1
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
Jonathan Long, Evan Shelhamer, Trevor Darrell, Fully convolutional networks for semantic segmentation computer vision and pattern recognition. pp. 3431- 3440 ,(2015) , 10.1109/CVPR.2015.7298965
Pablo Arbelaez, Jordi Pont-Tuset, Jon Barron, Ferran Marques, Jitendra Malik, Multiscale Combinatorial Grouping computer vision and pattern recognition. pp. 328- 335 ,(2014) , 10.1109/CVPR.2014.49
Vladimir Bychkovsky, Sylvain Paris, Eric Chan, Fredo Durand, Learning photographic global tonal adjustment with a database of input / output image pairs computer vision and pattern recognition. pp. 97- 104 ,(2011) , 10.1109/CVPR.2011.5995332
Mark Everingham, Luc Van Gool, Christopher K. I. Williams, John Winn, Andrew Zisserman, The Pascal Visual Object Classes (VOC) Challenge International Journal of Computer Vision. ,vol. 88, pp. 303- 338 ,(2010) , 10.1007/S11263-009-0275-4
Thomas Brox, Lubomir Bourdev, Subhransu Maji, Jitendra Malik, None, Object segmentation by alignment of poselet activations to image contours CVPR 2011. pp. 2225- 2232 ,(2011) , 10.1109/CVPR.2011.5995659
Yi Yang, Sam Hallman, Deva Ramanan, Charless Fowlkes, Layered object detection for multi-class segmentation computer vision and pattern recognition. pp. 3113- 3120 ,(2010) , 10.1109/CVPR.2010.5540070