Acoustic and facial features for speaker recognition

作者： M.J. Roach , J.D. Brand , J.S.D. Mason

DOI: 10.1109/ICPR.2000.903534

关键词:

摘要: This paper gives an insight into biometrics used for speaker recognition. Three different are presented, based on: acoustic, geometric lip, and holistic facial features. Experiments carried out using a corpus of the DAVID audio-visual database. Recognition accuracy is found to be similar in 2 domains. The visual feature on method signature coding contour lips mean dynamic signature, capturing motions face during spoken utterance. Physical (static measurements) demand only small model sizes, perhaps just single template, therefore require less training data. Conversely behavioral contain more variation

uni-trier.de 本地加速

swan.ac.uk 本地加速

doi.org 本地加速

uni-trier.de PDF 下载加速

sci-hub.se PDF 下载加速

参考文章(8)

Ted H. Applebaum, Brian A. Hanson, Tradeoffs in the design of regression features for word recognition. conference of the international speech communication association. ,(1991)

Ping S. Huang, Chris J. Harris, Mark S. Nixon, Visual Surveillance and Tracking of Humans by Face and Gait Recognition IFAC Proceedings Volumes. ,vol. 31, pp. 113- 118 ,(1998) , 10.1016/S1474-6670(17)38931-0

Juergen Luettin, Towards Speaker Independent Continuous Speechreading conference of the international speech communication association. pp. 1991- 1994 ,(1997)

Lionel Revéret, Christian Benoît, A Viseme-based Approach to Labiometrics for Automatic Lipreading AVBPA '97 Proceedings of the First International Conference on Audio- and Video-Based Biometric Person Authentication. pp. 335- 342 ,(1997) , 10.1007/BFB0016013

John S. Mason, Hywel B. Richards, John S. Bridle, Melvyn J. Hunt, Deriving articulatory representations of speech. conference of the international speech communication association. ,(1995)

Sadaoki Furui, Research on individuality features in speech waves and automatic speaker recognition techniques Speech Communication. ,vol. 5, pp. 183- 197 ,(1986) , 10.1016/0167-6393(86)90007-5

J.S.D. Mason, J. Brand, R. Auckenthaler, F. Deravi, C. Chibelushi, Lip signatures for automatic person recognition multimedia signal processing. pp. 457- 462 ,(1999) , 10.1109/MMSP.1999.793890

CC Chibelushi, S Gandon, JSD Mason, F Deravi, RD Johnston, Design issues for a digital audio-visual integrated database IEE Colloquium on Integrated Audio-Visual Processing for Recognition, Synthesis and Communication. pp. 7- 7 ,(1996) , 10.1049/IC:19961151

Acoustic and facial features for speaker recognition

来源期刊

我的账户

Acoustic and facial features for speaker recognition

来源期刊

相似文章 4

Synchronous HMMs for audio-visual speech processing

A survey of face detection, extraction and recognition

3D Facial Gestures in Biometrics: from Feasibility Study to Application

Assessing the Uniqueness and Permanence of Facial Actions for Use in Biometric Applications

我的账户