Speaker identification by lipreading

作者： J. Luettin , N.A. Thacker , S.W. Beet

DOI: 10.1109/ICSLP.1996.607030

关键词:

摘要: This paper describes a new approach for speaker identification based on lipreading. Visual features are extracted from image sequences of the talking face and consist shape parameters which describe lip boundary intensity grey-level distribution mouth area. Intensity information is principal component analysis using eigenspaces deform with model. The account both, speech dependent information. We built spatio-temporal models these features, HMMs mixtures Gaussians. Promising results were obtained text independent tests performed small video database.

参考文章(7)

J. Luettin, N.A. Thacker, S.W. Beet, Locating and tracking facial speech features international conference on pattern recognition. ,vol. 1, pp. 652- 656 ,(1996) , 10.1109/ICPR.1996.546105

R. Brunelli, D. Falavigna, Person identification using multiple cues IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 17, pp. 955- 966 ,(1995) , 10.1109/34.464560

R. Chellappa, C.L. Wilson, S. Sirohey, Human and machine recognition of faces: a survey Proceedings of the IEEE. ,vol. 83, pp. 705- 741 ,(1995) , 10.1109/5.381842

J. Luettin, N.A. Thacker, S.W. Beet, Speechreading using shape and intensity information international conference on spoken language processing. ,vol. 1, pp. 58- 61 ,(1996) , 10.1109/ICSLP.1996.607024

Javier R. Movellan, Visual Speech Recognition with Stochastic Networks neural information processing systems. ,vol. 7, pp. 851- 858 ,(1994)

D. Genoud, J. Bigun, C. Beumier, S. Pigeon, I. Pitas, M. Acheroy, B. Duc, P. Lockwood, L. Vandendorpe, G. Chollet, G. Maitre, K. Sobottka, S. Fischer, Multi-modal person verification tools using speech and images Proc. European Conference on Multimedia Applications, Services and Techniques (ECMAST 96). pp. 747- 761 ,(1996)

TF Cootes, A Hill, CJ Taylor, J Haslam, Use of active shape models for locating structures in medical images Image and Vision Computing. ,vol. 12, pp. 355- 365 ,(1994) , 10.1016/0262-8856(94)90060-4

Speaker identification by lipreading

来源期刊

我的账户

Speaker identification by lipreading

来源期刊

相似文章 10

我的账户