A syntactic approach to automatic lip feature extraction for speaker identification

作者: T. Wark , S. Sridharan

DOI: 10.1109/ICASSP.1998.679685

关键词:

摘要: This paper presents a novel technique for the tracking and extraction of features from lips purpose speaker identification. In noisy or other adverse conditions, identification performance via speech signal can significantly reduce, hence additional information which complement is particular interest. our system, syntactic derived chromatic in lip region. A model contour formed directly information, with no minimization procedure required to refine estimates. Colour are then extracted profiles taken around contour. Further improvement obtained linear discriminant analysis (LDA). Speaker models built based on Gaussian mixture (GMM). Identification experiments performed M2VTS database, encouraging results.

参考文章(7)
Tarcisio Coianiz, Lorenzo Torresani, Bruno Caprile, 2D Deformable Models for Visual Speech Analysis Springer, Berlin, Heidelberg. pp. 391- 398 ,(1996) , 10.1007/978-3-662-13015-5_29
M. U. Ramos Sánchez, J. Matas, J. Kittler, Statistical Chromaticity Models for Lip Tracking with B-splines AVBPA '97 Proceedings of the First International Conference on Audio- and Video-Based Biometric Person Authentication. pp. 69- 76 ,(1997) , 10.1007/BFB0015981
J. Luettin, N.A. Thacker, S.W. Beet, Locating and tracking facial speech features international conference on pattern recognition. ,vol. 1, pp. 652- 656 ,(1996) , 10.1109/ICPR.1996.546105
R. J. Mammone, Xiaoyu Zhang, R. P. Ramachandran, Robust speaker recognition: a feature-based approach IEEE Signal Processing Magazine. ,vol. 13, pp. 58- 71 ,(1996) , 10.1109/79.536825
Douglas A. Reynolds, Speaker identification and verification using Gaussian mixture speaker models Speech Communication. ,vol. 17, pp. 91- 108 ,(1995) , 10.1016/0167-6393(95)00009-D
R. Chellappa, C.L. Wilson, S. Sirohey, Human and machine recognition of faces: a survey Proceedings of the IEEE. ,vol. 83, pp. 705- 741 ,(1995) , 10.1109/5.381842
J. Luettin, N.A. Thacker, S.W. Beet, Learning to recognise talking faces international conference on pattern recognition. ,vol. 4, pp. 55- 59 ,(1996) , 10.1109/ICPR.1996.547233