Nonparametric Feature Matching Based Conditional Random Fields for Gesture Recognition from Multi-Modal Video

作者: Ju Yong Chang

DOI: 10.1109/TPAMI.2016.2519021

关键词:

摘要: We present a new gesture recognition method that is based on the conditional random field (CRF) model using multiple feature matching. Our approach solves labeling problem, determining categories and their temporal ranges at same time. A generative probabilistic formalized probability densities are nonparametrically estimated by matching input features with training dataset. In addition to conventional skeletal joint-based features, appearance information near active hand in an RGB image exploited capture detailed motion of fingers. The likelihood function then used as unary term for our CRF model. smoothness also incorporated enforce coherence solution. Frame-wise results can be obtained applying efficient dynamic programming technique. To estimate parameters proposed model, we incorporate structured support vector machine (SSVM) framework perform learning large-scale datasets. Experimental demonstrate provides effective challenging real By scoring 0.8563 mean Jaccard index, has state-of-the-art track 2014 ChaLearn Looking People (LAP) Challenge.

参考文章(55)
Tomas Pfister, James Charles, Andrew Zisserman, Domain-Adaptive Discriminative One-Shot Learning of Gestures Computer Vision – ECCV 2014. pp. 814- 829 ,(2014) , 10.1007/978-3-319-10599-4_52
Mao Ye, Qing Zhang, Liang Wang, Jiejie Zhu, Ruigang Yang, Juergen Gall, A Survey on Human Motion Analysis from Depth Data Lecture Notes in Computer Science. pp. 149- 187 ,(2013) , 10.1007/978-3-642-44964-2_8
Lionel Pigou, Sander Dieleman, Pieter-Jan Kindermans, Benjamin Schrauwen, Sign Language Recognition Using Convolutional Neural Networks european conference on computer vision. pp. 572- 578 ,(2014) , 10.1007/978-3-319-16178-5_40
Ju Yong Chang, Nonparametric Gesture Labeling from Multi-modal Data european conference on computer vision. pp. 503- 517 ,(2014) , 10.1007/978-3-319-16178-5_35
Jordi Gonzàlez, Miguel A. Bautista, Meysam Madadi, Miguel Reyes, Víctor Ponce-López, Hugo J. Escalante, Jamie Shotton, Isabelle Guyon, Sergio Escalera, Xavier Baró, Chalearn looking at people challenge 2014: Dataset and results european conference on computer vision. pp. 459- 473 ,(2014) , 10.1007/978-3-319-16178-5_32
Guang Chen, Daniel Clarke, Manuel Giuliani, Andre Gaschler, Di Wu, David Weikersdorfer, Alois Knoll, Multi-modality Gesture Detection and Recognition with Un-supervision, Randomization and Discrimination european conference on computer vision. ,vol. 8925, pp. 608- 622 ,(2014) , 10.1007/978-3-319-16178-5_43
Natalia Neverova, Christian Wolf, Graham W. Taylor, Florian Nebout, Multi-scale Deep Learning for Gesture Detection and Localization european conference on computer vision. pp. 474- 490 ,(2014) , 10.1007/978-3-319-16178-5_33
Di Wu, Ling Shao, Deep Dynamic Neural Networks for Gesture Segmentation and Recognition european conference on computer vision. pp. 552- 571 ,(2014) , 10.1007/978-3-319-16178-5_39
Elizabeth Baird, Robbin Battison, Lexical Borrowing in American Sign Language ,(1978)
Navneet Dalal, Bill Triggs, Cordelia Schmid, Human detection using oriented histograms of flow and appearance european conference on computer vision. ,vol. 3952, pp. 428- 441 ,(2006) , 10.1007/11744047_33