High-Resolution Semantic Labeling with Convolutional Neural Networks

作者: Emmanuel Maggiori , Yuliya Tarabalka , Guillaume Charpiat , Pierre Alliez

DOI: 10.1109/TGRS.2017.2740362

关键词: Semantic labelingCategorizationMachine learningHigh resolutionPattern recognitionIdeal (set theory)Aerial imageImage (mathematics)Convolutional neural networkArtificial intelligenceComputer science

摘要: Convolutional neural networks (CNNs) have received increasing attention over the last few years. They were initially conceived for image categorization, i.e., problem of assigning a semantic label to an entire input image. In this paper we address dense labeling, which consists in every pixel Since requires high spatial accuracy determine where labels are assigned, categorization CNNs, intended be highly robust local deformations, not directly applicable. By adapting networks, many labeling CNNs been recently proposed. Our first contribution is in-depth analysis these architectures. We establish desired properties ideal CNN, and assess how those methods stand with regard properties. observe that even though they provide competitive results, often underexploit could lead more effective efficient Out observations, then derive CNN framework specifically adapted problem. In addition learning features at different resolutions, it learns combine features. By integrating global information flexible manner, outperforms previous techniques. evaluate proposed compare state-of-the-art architectures on public benchmarks high-resolution aerial labeling.

参考文章(33)
Vijay Badrinarayanan, Roberto Cipolla, Ankur Handa, SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling computer vision and pattern recognition. ,(2015)
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification international conference on computer vision. pp. 1026- 1034 ,(2015) , 10.1109/ICCV.2015.123
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
Hyeonwoo Noh, Seunghoon Hong, Bohyung Han, Learning Deconvolution Network for Semantic Segmentation international conference on computer vision. pp. 1520- 1528 ,(2015) , 10.1109/ICCV.2015.178
Yoshua Bengio, Practical recommendations for gradient-based training of deep architectures Neural Networks: Tricks of the Trade (2nd ed.). pp. 437- 478 ,(2012) , 10.1007/978-3-642-35289-8_26
Matthew D. Zeiler, Rob Fergus, Visualizing and Understanding Convolutional Networks european conference on computer vision. pp. 818- 833 ,(2014) , 10.1007/978-3-319-10590-1_53
Jonathan Long, Evan Shelhamer, Trevor Darrell, Fully convolutional networks for semantic segmentation computer vision and pattern recognition. pp. 3431- 3440 ,(2015) , 10.1109/CVPR.2015.7298965
Sakrapee Paisitkriangkrai, Jamie Sherrah, Pranam Janney, Anton Van-Den Hengel, Effective semantic pixel labelling with convolutional networks and Conditional Random Fields computer vision and pattern recognition. pp. 36- 43 ,(2015) , 10.1109/CVPRW.2015.7301381
Clement Farabet, Camille Couprie, Laurent Najman, Yann LeCun, Learning Hierarchical Features for Scene Labeling IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 35, pp. 1915- 1929 ,(2013) , 10.1109/TPAMI.2012.231