Acoustic and facial features for speaker recognition

作者: M.J. Roach , J.D. Brand , J.S.D. Mason

DOI: 10.1109/ICPR.2000.903534

关键词:

摘要: This paper gives an insight into biometrics used for speaker recognition. Three different are presented, based on: acoustic, geometric lip, and holistic facial features. Experiments carried out using a corpus of the DAVID audio-visual database. Recognition accuracy is found to be similar in 2 domains. The visual feature on method signature coding contour lips mean dynamic signature, capturing motions face during spoken utterance. Physical (static measurements) demand only small model sizes, perhaps just single template, therefore require less training data. Conversely behavioral contain more variation

参考文章(8)
Ted H. Applebaum, Brian A. Hanson, Tradeoffs in the design of regression features for word recognition. conference of the international speech communication association. ,(1991)
Ping S. Huang, Chris J. Harris, Mark S. Nixon, Visual Surveillance and Tracking of Humans by Face and Gait Recognition IFAC Proceedings Volumes. ,vol. 31, pp. 113- 118 ,(1998) , 10.1016/S1474-6670(17)38931-0
Juergen Luettin, Towards Speaker Independent Continuous Speechreading conference of the international speech communication association. pp. 1991- 1994 ,(1997)
Lionel Revéret, Christian Benoît, A Viseme-based Approach to Labiometrics for Automatic Lipreading AVBPA '97 Proceedings of the First International Conference on Audio- and Video-Based Biometric Person Authentication. pp. 335- 342 ,(1997) , 10.1007/BFB0016013
John S. Mason, Hywel B. Richards, John S. Bridle, Melvyn J. Hunt, Deriving articulatory representations of speech. conference of the international speech communication association. ,(1995)
J.S.D. Mason, J. Brand, R. Auckenthaler, F. Deravi, C. Chibelushi, Lip signatures for automatic person recognition multimedia signal processing. pp. 457- 462 ,(1999) , 10.1109/MMSP.1999.793890
CC Chibelushi, S Gandon, JSD Mason, F Deravi, RD Johnston, Design issues for a digital audio-visual integrated database IEE Colloquium on Integrated Audio-Visual Processing for Recognition, Synthesis and Communication. pp. 7- 7 ,(1996) , 10.1049/IC:19961151