Deep Human Parsing with Active Template Regression

作者： Xiaodan Liang , Si Liu , Xiaohui Shen , Jianchao Yang , Luoqi Liu

DOI: 10.1109/TPAMI.2015.2408360

关键词: Parsing 、 Artificial intelligence 、 Normalization (statistics) 、 Pixel 、 Artificial neural network 、 Smoothing 、 Computer science 、 Body region 、 Convolutional neural network 、 Feature extraction 、 Shape analysis (digital geometry) 、 Pattern recognition

摘要: In this work, the human parsing task, namely decomposing a human image into semantic fashion/body regions, is formulated as an active template regression (ATR) problem, where the normalized mask of each fashion/body item is expressed as the linear combination of the learned mask templates, and then morphed to a more precise mask with the active shape parameters, including position, scale and visibility of each semantic region. The mask template coefficients and the active shape parameters together can generate the human …

arxiv.org PDF 下载加速

ieee.org LINK 下载加速

uni-trier.de PDF 下载加速

参考文章(31)

João Carreira, Rui Caseiro, Jorge Batista, Cristian Sminchisescu, Semantic Segmentation with Second-Order Pooling Computer Vision – ECCV 2012. pp. 430- 443 ,(2012) , 10.1007/978-3-642-33786-4_32

Huizhong Chen, Andrew Gallagher, Bernd Girod, Describing Clothing by Semantic Attributes Computer Vision – ECCV 2012. pp. 609- 623 ,(2012) , 10.1007/978-3-642-33712-3_44

Ronan Collobert, Pedro Pinheiro, Recurrent Convolutional Neural Networks for Scene Labeling international conference on machine learning. pp. 82- 90 ,(2014)

Matthew D. Zeiler, Rob Fergus, Visualizing and Understanding Convolutional Networks european conference on computer vision. pp. 818- 833 ,(2014) , 10.1007/978-3-319-10590-1_53

Daniel D. Lee, H. Sebastian Seung, Learning the parts of objects by non-negative matrix factorization Nature. ,vol. 401, pp. 788- 791 ,(1999) , 10.1038/44565

J. Wright, Wenli Xu, Yi Ma, Yigang Peng, A. Ganesh, RASL: Robust Alignment by Sparse and Low-Rank Decomposition for Linearly Correlated Images IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 34, pp. 2233- 2246 ,(2012) , 10.1109/TPAMI.2011.282

Liang Lin, Xiaolong Wang, Wei Yang, Jian-Huang Lai, Discriminatively Trained And-Or Graph Models for Object Shape Detection IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 37, pp. 959- 972 ,(2015) , 10.1109/TPAMI.2014.2359888

Pedro F. Felzenszwalb, Daniel P. Huttenlocher, Efficient Graph-Based Image Segmentation International Journal of Computer Vision. ,vol. 59, pp. 167- 181 ,(2004) , 10.1023/B:VISI.0000022288.19776.77

Jian Dong, Qiang Chen, Wei Xia, Zhongyang Huang, Shuicheng Yan, A Deformable Mixture Parsing Model with Parselets international conference on computer vision. pp. 3408- 3415 ,(2013) , 10.1109/ICCV.2013.423

10.

Clement Farabet, Camille Couprie, Laurent Najman, Yann LeCun, Learning Hierarchical Features for Scene Labeling IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 35, pp. 1915- 1929 ,(2013) , 10.1109/TPAMI.2012.231

Deep Human Parsing with Active Template Regression

来源期刊

我的账户

Deep Human Parsing with Active Template Regression

来源期刊

相似文章 10

我的账户