Discriminative Deep Face Shape Model for Facial Point Detection

作者: Yue Wu , Qiang Ji

DOI: 10.1007/S11263-014-0775-8

关键词:

摘要: Facial point detection is an active area in computer vision due to its relevance many applications. It a nontrivial task, since facial shapes vary significantly with expressions, poses or occlusion. In this paper, we address problem by proposing discriminative deep face shape model that constructed based on augmented factorized three-way Restricted Boltzmann Machines model. Specifically, the combines top-down information from embedded patterns and bottom up measurements local detectors unified framework. addition, along model, effective algorithms are proposed perform learning infer true locations their measurements. Based 68 points detected images both controlled "in-the-wild" conditions. Experiments benchmark data sets show effectiveness of algorithm against state-of-the-art methods.

参考文章(37)
Geoffrey E. Hinton, Ruslan Salakhutdinov, Deep Boltzmann machines international conference on artificial intelligence and statistics. pp. 448- 455 ,(2009)
Vuong Le, Jonathan Brandt, Zhe Lin, Lubomir Bourdev, Thomas S. Huang, Interactive Facial Feature Localization Computer Vision – ECCV 2012. pp. 679- 692 ,(2012) , 10.1007/978-3-642-33712-3_49
Peter N. Belhumeur, David W. Jacobs, David J. Kriegman, Neeraj Kumar, Localizing Parts of Faces Using a Consensus of Exemplars IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 35, pp. 2930- 2940 ,(2013) , 10.1109/TPAMI.2013.23
Alex J. Smola, Bernhard Schölkopf, A tutorial on support vector regression Statistics and Computing. ,vol. 14, pp. 199- 222 ,(2004) , 10.1023/B:STCO.0000035301.49549.88
Yi Sun, Xiaogang Wang, Xiaoou Tang, Deep Convolutional Network Cascade for Facial Point Detection computer vision and pattern recognition. pp. 3476- 3483 ,(2013) , 10.1109/CVPR.2013.446
Abdel-rahman Mohamed, George E. Dahl, Geoffrey Hinton, Acoustic Modeling Using Deep Belief Networks IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 20, pp. 14- 22 ,(2012) , 10.1109/TASL.2011.2109382
Christos Sagonas, Georgios Tzimiropoulos, Stefanos Zafeiriou, Maja Pantic, A Semi-automatic Methodology for Facial Landmark Annotation computer vision and pattern recognition. pp. 896- 903 ,(2013) , 10.1109/CVPRW.2013.132
Graham W. Taylor, Leonid Sigal, David J. Fleet, Geoffrey E. Hinton, Dynamical binary latent variable models for 3D human pose tracking computer vision and pattern recognition. pp. 631- 638 ,(2010) , 10.1109/CVPR.2010.5540157
Simon Baker, Iain Matthews, Lucas-Kanade 20 Years On: A Unifying Framework International Journal of Computer Vision. ,vol. 56, pp. 221- 255 ,(2004) , 10.1023/B:VISI.0000011205.11775.FD
Georgios Tzimiropoulos, Maja Pantic, Optimization Problems for Fast AAM Fitting in-the-Wild international conference on computer vision. pp. 593- 600 ,(2013) , 10.1109/ICCV.2013.79