A syntactic approach to automatic lip feature extraction for speaker identification

作者： T. Wark , S. Sridharan

DOI: 10.1109/ICASSP.1998.679685

关键词:

摘要: This paper presents a novel technique for the tracking and extraction of features from lips purpose speaker identification. In noisy or other adverse conditions, identification performance via speech signal can significantly reduce, hence additional information which complement is particular interest. our system, syntactic derived chromatic in lip region. A model contour formed directly information, with no minimization procedure required to refine estimates. Colour are then extracted profiles taken around contour. Further improvement obtained linear discriminant analysis (LDA). Speaker models built based on Gaussian mixture (GMM). Identification experiments performed M2VTS database, encouraging results.

uni-trier.de 本地加速

ieee.org 本地加速

uni-trier.de PDF 下载加速

sci-hub.se PDF 下载加速

参考文章(7)

Tarcisio Coianiz, Lorenzo Torresani, Bruno Caprile, 2D Deformable Models for Visual Speech Analysis Springer, Berlin, Heidelberg. pp. 391- 398 ,(1996) , 10.1007/978-3-662-13015-5_29

M. U. Ramos Sánchez, J. Matas, J. Kittler, Statistical Chromaticity Models for Lip Tracking with B-splines AVBPA '97 Proceedings of the First International Conference on Audio- and Video-Based Biometric Person Authentication. pp. 69- 76 ,(1997) , 10.1007/BFB0015981

J. Luettin, N.A. Thacker, S.W. Beet, Locating and tracking facial speech features international conference on pattern recognition. ,vol. 1, pp. 652- 656 ,(1996) , 10.1109/ICPR.1996.546105

R. J. Mammone, Xiaoyu Zhang, R. P. Ramachandran, Robust speaker recognition: a feature-based approach IEEE Signal Processing Magazine. ,vol. 13, pp. 58- 71 ,(1996) , 10.1109/79.536825

Douglas A. Reynolds, Speaker identification and verification using Gaussian mixture speaker models Speech Communication. ,vol. 17, pp. 91- 108 ,(1995) , 10.1016/0167-6393(95)00009-D

R. Chellappa, C.L. Wilson, S. Sirohey, Human and machine recognition of faces: a survey Proceedings of the IEEE. ,vol. 83, pp. 705- 741 ,(1995) , 10.1109/5.381842

J. Luettin, N.A. Thacker, S.W. Beet, Learning to recognise talking faces international conference on pattern recognition. ,vol. 4, pp. 55- 59 ,(1996) , 10.1109/ICPR.1996.547233

A syntactic approach to automatic lip feature extraction for speaker identification

来源期刊

我的账户

A syntactic approach to automatic lip feature extraction for speaker identification

来源期刊

相似文章 10

我的账户