Deep Human Parsing with Active Template Regression

作者: Xiaodan Liang , Si Liu , Xiaohui Shen , Jianchao Yang , Luoqi Liu

DOI: 10.1109/TPAMI.2015.2408360

关键词: ParsingArtificial intelligenceNormalization (statistics)PixelArtificial neural networkSmoothingComputer scienceBody regionConvolutional neural networkFeature extractionShape analysis (digital geometry)Pattern recognition

摘要: In this work, the human parsing task, namely decomposing a human image into semantic fashion/body regions, is formulated as an active template regression (ATR) problem, where the normalized mask of each fashion/body item is expressed as the linear combination of the learned mask templates, and then morphed to a more precise mask with the active shape parameters, including position, scale and visibility of each semantic region. The mask template coefficients and the active shape parameters together can generate the human …

参考文章(31)
João Carreira, Rui Caseiro, Jorge Batista, Cristian Sminchisescu, Semantic Segmentation with Second-Order Pooling Computer Vision – ECCV 2012. pp. 430- 443 ,(2012) , 10.1007/978-3-642-33786-4_32
Huizhong Chen, Andrew Gallagher, Bernd Girod, Describing Clothing by Semantic Attributes Computer Vision – ECCV 2012. pp. 609- 623 ,(2012) , 10.1007/978-3-642-33712-3_44
Ronan Collobert, Pedro Pinheiro, Recurrent Convolutional Neural Networks for Scene Labeling international conference on machine learning. pp. 82- 90 ,(2014)
Matthew D. Zeiler, Rob Fergus, Visualizing and Understanding Convolutional Networks european conference on computer vision. pp. 818- 833 ,(2014) , 10.1007/978-3-319-10590-1_53
Daniel D. Lee, H. Sebastian Seung, Learning the parts of objects by non-negative matrix factorization Nature. ,vol. 401, pp. 788- 791 ,(1999) , 10.1038/44565
J. Wright, Wenli Xu, Yi Ma, Yigang Peng, A. Ganesh, RASL: Robust Alignment by Sparse and Low-Rank Decomposition for Linearly Correlated Images IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 34, pp. 2233- 2246 ,(2012) , 10.1109/TPAMI.2011.282
Liang Lin, Xiaolong Wang, Wei Yang, Jian-Huang Lai, Discriminatively Trained And-Or Graph Models for Object Shape Detection IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 37, pp. 959- 972 ,(2015) , 10.1109/TPAMI.2014.2359888
Pedro F. Felzenszwalb, Daniel P. Huttenlocher, Efficient Graph-Based Image Segmentation International Journal of Computer Vision. ,vol. 59, pp. 167- 181 ,(2004) , 10.1023/B:VISI.0000022288.19776.77
Jian Dong, Qiang Chen, Wei Xia, Zhongyang Huang, Shuicheng Yan, A Deformable Mixture Parsing Model with Parselets international conference on computer vision. pp. 3408- 3415 ,(2013) , 10.1109/ICCV.2013.423
Clement Farabet, Camille Couprie, Laurent Najman, Yann LeCun, Learning Hierarchical Features for Scene Labeling IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 35, pp. 1915- 1929 ,(2013) , 10.1109/TPAMI.2012.231