BLAN: Bi-directional ladder attentive network for facial attribute prediction

作者: Xin Zheng , Huaibo Huang , Yanqing Guo , Bo Wang , Ran He

DOI: 10.1016/J.PATCOG.2019.107155

关键词: Computer scienceArtificial intelligenceHierarchyMutual informationAutoencoderPattern recognitionResidual

摘要: Abstract Deep facial attribute prediction has received considerable attention with a wide range of real-world applications in the past few years. Existing works almost extract abstract global features at high levels deep neural networks to make predictions. However, local low levels, which contain detailed information, are not well exploited. In this paper, we propose novel Bi-directional Ladder Attentive Network (BLAN) learn hierarchical representations, covering correlations between feature hierarchies and characteristics. BLAN adopts layer-wise bi-directional connections based on autoencoder framework from levels. way, characteristics could be correspondingly interweaved each level via multiple designed Residual Dual Attention Modules (RDAMs). Besides, derive Local Mutual Information Maximization (LMIM) loss further incorporate locality attributes high-level representations hierarchy. Multiple classifiers receive produce decisions, followed by proposed adaptive score fusion module merge these decisions for yielding final result. Extensive experiments two datasets, CelebA LFWA, demonstrate that our outperforms state-of-the-art methods.

参考文章(42)
Harri Valpola, Tapani Raiko, Antti Rasmus, Denoising autoencoder with modulated lateral connections learns invariant representations of natural images arXiv: Neural and Evolutionary Computing. ,(2014)
Ziwei Liu, Ping Luo, Xiaogang Wang, Xiaoou Tang, Deep Learning Face Attributes in the Wild 2015 IEEE International Conference on Computer Vision (ICCV). pp. 3730- 3738 ,(2015) , 10.1109/ICCV.2015.425
Panagiotis Perakis, Theoharis Theoharis, Ioannis A. Kakadiaris, Feature fusion for facial landmark detection Pattern Recognition. ,vol. 47, pp. 2783- 2793 ,(2014) , 10.1016/J.PATCOG.2014.03.007
Naftali Tishby, Noam Slonim, Agglomerative Information Bottleneck neural information processing systems. ,vol. 12, pp. 617- 623 ,(1999)
Aapo Hyvärinen, Michael U. Gutmann, Noise-contrastive estimation of unnormalized statistical models, with applications to natural image statistics Journal of Machine Learning Research. ,vol. 13, pp. 307- 361 ,(2012) , 10.5555/2188385.2188396
Xiaoou Tang, Xiaogang Wang, Yi Sun, Yuheng Chen, Deep Learning Face Representation by Joint Identification-Verification neural information processing systems. ,vol. 27, pp. 1988- 1996 ,(2014)
Ning Zhang, Manohar Paluri, Marc'Aurelio Ranzato, Trevor Darrell, Lubomir Bourdev, PANDA: Pose Aligned Networks for Deep Attribute Modeling computer vision and pattern recognition. pp. 1637- 1644 ,(2014) , 10.1109/CVPR.2014.212
Aapo Hyvärinen, Michael Gutmann, Noise-contrastive estimation: A new estimation principle for unnormalized statistical models international conference on artificial intelligence and statistics. pp. 297- 304 ,(2010)
Hanchuan Peng, Fuhui Long, C. Ding, Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 27, pp. 1226- 1238 ,(2005) , 10.1109/TPAMI.2005.159
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Deep Residual Learning for Image Recognition computer vision and pattern recognition. pp. 770- 778 ,(2016) , 10.1109/CVPR.2016.90