Learning to recognise talking faces

作者: J. Luettin , N.A. Thacker , S.W. Beet

DOI: 10.1109/ICPR.1996.547233

关键词: Speech recognitionFace (geometry)Facial recognition systemArtificial intelligenceHidden Markov modelIdentification (information)Pattern recognitionComputer scienceParametric modelVisible SpeechSpeech productionSequence

摘要: An approach for person identification is described based on spatio-temporal analysis of the talking face. A represented by a parametric model visible speech articulators and their temporal characteristics during production. The consists shape parameters, representing lip contour intensity parameters grey level distribution in mouth region. used to track lips image sequences where are recovered from tracking results. While some these relate information, others intuitively related different persons we show that models features enable successful identification. We as mixtures Gaussians dependencies hidden Markov models. Identifying performed estimating likelihood each having generated observed sequence with highest chosen identified person.

参考文章(23)
Kenji Mase, Recognition of Facial Expression from Optical Flow IEICE Transactions on Information and Systems. ,vol. 74, pp. 3474- 3483 ,(1991)
Juergen Luettin, Neil A. Thacker, Steve W. Beet, Active Shape Models for Visual Speech Feature Extraction Speechreading by Humans and Machines. ,vol. 150, pp. 383- 390 ,(1996) , 10.1007/978-3-662-13015-5_28
Lawrence Rabiner, Biing-Hwang Juang, Fundamentals of speech recognition ,(1993)
Lalit R. Bahl, Frederick Jelinek, Robert L. Mercer, A Maximum Likelihood Approach to Continuous Speech Recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. PAMI-5, pp. 179- 190 ,(1983) , 10.1109/TPAMI.1983.4767370
Gillian Rhodes, Susan Brennan, Susan Carey, Identification and ratings of caricatures: implications for mental representations of faces. Cognitive Psychology. ,vol. 19, pp. 473- 497 ,(1987) , 10.1016/0010-0285(87)90016-8
R. Brunelli, D. Falavigna, T. Poggio, L. Stringa, Automatic person recognition by acoustic and geometric features machine vision applications. ,vol. 8, pp. 317- 325 ,(1995) , 10.1007/S001380050012
Dominique Valentin, Hervé Abdi, Alice J. O'Toole, Garrison W. Cottrell, Connectionist models of face processing: A survey Pattern Recognition. ,vol. 27, pp. 1209- 1230 ,(1994) , 10.1016/0031-3203(94)90006-X
J. Luettin, N.A. Thacker, S.W. Beet, Locating and tracking facial speech features international conference on pattern recognition. ,vol. 1, pp. 652- 656 ,(1996) , 10.1109/ICPR.1996.546105
R. Brunelli, D. Falavigna, Person identification using multiple cues IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 17, pp. 955- 966 ,(1995) , 10.1109/34.464560
Ashok Samal, Prasana A. Iyengar, Automatic recognition and analysis of human faces and facial expressions: a survey Pattern Recognition. ,vol. 25, pp. 65- 77 ,(1992) , 10.1016/0031-3203(92)90007-6