Learning to recognise talking faces

作者： J. Luettin , N.A. Thacker , S.W. Beet

DOI: 10.1109/ICPR.1996.547233

关键词: Speech recognition 、 Face (geometry) 、 Facial recognition system 、 Artificial intelligence 、 Hidden Markov model 、 Identification (information) 、 Pattern recognition 、 Computer science 、 Parametric model 、 Visible Speech 、 Speech production 、 Sequence

摘要: An approach for person identification is described based on spatio-temporal analysis of the talking face. A represented by a parametric model visible speech articulators and their temporal characteristics during production. The consists shape parameters, representing lip contour intensity parameters grey level distribution in mouth region. used to track lips image sequences where are recovered from tracking results. While some these relate information, others intuitively related different persons we show that models features enable successful identification. We as mixtures Gaussians dependencies hidden Markov models. Identifying performed estimating likelihood each having generated observed sequence with highest chosen identified person.

uni-trier.de 本地加速

epfl.ch 本地加速

ieee.org 本地加速

ieeecomputersociety.org 本地加速

sci-hub.se PDF 下载加速

参考文章(23)

Kenji Mase, Recognition of Facial Expression from Optical Flow IEICE Transactions on Information and Systems. ,vol. 74, pp. 3474- 3483 ,(1991)

Juergen Luettin, Neil A. Thacker, Steve W. Beet, Active Shape Models for Visual Speech Feature Extraction Speechreading by Humans and Machines. ,vol. 150, pp. 383- 390 ,(1996) , 10.1007/978-3-662-13015-5_28

Lawrence Rabiner, Biing-Hwang Juang, Fundamentals of speech recognition ,(1993)

Lalit R. Bahl, Frederick Jelinek, Robert L. Mercer, A Maximum Likelihood Approach to Continuous Speech Recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. PAMI-5, pp. 179- 190 ,(1983) , 10.1109/TPAMI.1983.4767370

Gillian Rhodes, Susan Brennan, Susan Carey, Identification and ratings of caricatures: implications for mental representations of faces. Cognitive Psychology. ,vol. 19, pp. 473- 497 ,(1987) , 10.1016/0010-0285(87)90016-8

R. Brunelli, D. Falavigna, T. Poggio, L. Stringa, Automatic person recognition by acoustic and geometric features machine vision applications. ,vol. 8, pp. 317- 325 ,(1995) , 10.1007/S001380050012

Dominique Valentin, Hervé Abdi, Alice J. O'Toole, Garrison W. Cottrell, Connectionist models of face processing: A survey Pattern Recognition. ,vol. 27, pp. 1209- 1230 ,(1994) , 10.1016/0031-3203(94)90006-X

J. Luettin, N.A. Thacker, S.W. Beet, Locating and tracking facial speech features international conference on pattern recognition. ,vol. 1, pp. 652- 656 ,(1996) , 10.1109/ICPR.1996.546105

R. Brunelli, D. Falavigna, Person identification using multiple cues IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 17, pp. 955- 966 ,(1995) , 10.1109/34.464560

10.

Ashok Samal, Prasana A. Iyengar, Automatic recognition and analysis of human faces and facial expressions: a survey Pattern Recognition. ,vol. 25, pp. 65- 77 ,(1992) , 10.1016/0031-3203(92)90007-6

Learning to recognise talking faces

来源期刊

我的账户

Learning to recognise talking faces

来源期刊

相似文章 10

我的账户