Deep Adaptive Feature Aggregation in Multi-task Convolutional Neural Networks

作者: Zhen Shen , Chaoran Cui , Jin Huang , Jian Zong , Meng Chen

DOI: 10.1145/3340531.3412132

关键词: Construct (python library)Task (project management)Computer scienceLayer (object-oriented design)Artificial intelligenceDegree (graph theory)Feature (computer vision)Convolutional neural networkFeature aggregationPattern recognitionMulti-task learning

摘要: Convolutional Neural Network (CNN) based multi-task learning methods have been widely used in a variety of applications computer vision. Towards effective CNN architectures, recent studies automatically learn the optimal combinations task-specific features at single network layers. However, they generally construct an unchanged operation feature aggregation after training, regardless characteristics input features. In this paper, we propose novel Adaptive Feature Aggregation (AFA) layer for CNNs, which dynamic mechanism is designed to allow each task adaptively determine degree different tasks needed according dependencies. On both pixel-level and image-level tasks, demonstrate that our approach significantly outperforms previous state-of-the-art CNNs.

参考文章(10)
Nathan Silberman, Derek Hoiem, Pushmeet Kohli, Rob Fergus, Indoor Segmentation and Support Inference from RGBD Images Computer Vision – ECCV 2012. pp. 746- 760 ,(2012) , 10.1007/978-3-642-33715-4_54
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Deep Residual Learning for Image Recognition computer vision and pattern recognition. pp. 770- 778 ,(2016) , 10.1109/CVPR.2016.90
Xiang Li, Wenhai Wang, Xiaolin Hu, Jian Yang, Selective Kernel Networks 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 510- 519 ,(2019) , 10.1109/CVPR.2019.00060
Brendan Jou, Shih-Fu Chang, Deep Cross Residual Learning for Multitask Visual Recognition acm multimedia. pp. 998- 1007 ,(2016) , 10.1145/2964284.2964309
Jie Hu, Li Shen, Samuel Albanie, Gang Sun, Enhua Wu, Squeeze-and-Excitation Networks IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 42, pp. 2011- 2023 ,(2020) , 10.1109/TPAMI.2019.2913372
Iro Laina, Christian Rupprecht, Vasileios Belagiannis, Federico Tombari, Nassir Navab, Deeper Depth Prediction with Fully Convolutional Residual Networks international conference on 3d vision. pp. 239- 248 ,(2016) , 10.1109/3DV.2016.32
Ishan Misra, Abhinav Shrivastava, Abhinav Gupta, Martial Hebert, None, Cross-Stitch Networks for Multi-task Learning computer vision and pattern recognition. pp. 3994- 4003 ,(2016) , 10.1109/CVPR.2016.433
Yuan Gao, Jiayi Ma, Mingbo Zhao, Wei Liu, Alan L. Yuille, NDDR-CNN: Layerwise Feature Fusing in Multi-Task CNNs by Neural Discriminative Dimensionality Reduction 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). pp. 3205- 3214 ,(2019) , 10.1109/CVPR.2019.00332
Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff, Hartwig Adam, Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation european conference on computer vision. pp. 833- 851 ,(2018) , 10.1007/978-3-030-01234-2_49
Jie Hu, Li Shen, Samuel Albanie, Gang Sun, Enhua Wu, Squeeze-and-Excitation Networks 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. ,(2018) , 10.1109/CVPR.2018.00745