Automatic combined lip segmentation in color images

作者: Hamed Talea , Khashayar Yaghmaie

DOI: 10.1109/ICCSN.2011.6014860

关键词:

摘要: Automatic speech recognition (ASR) performs well under restricted conditions. Lipreading is a main part of audio-visual systems and an accurate algorithm for lip detection motion tracking helps to improve the rate efficiently. This paper proposes new combined method from combination red exclusion algorithm. Accuracy proposed verified by applying it several images.

参考文章(16)
Eric David Petajan, Automatic lipreading to enhance speech recognition (speech reading) University of Illinois at Urbana-Champaign. ,(1984)
J. Andrew Bangham, Richard Harvey, Iain Matthews, Stephen Cox, Nonlinear scale decomposition based features for visual speech recognition european signal processing conference. pp. 1- 4 ,(1998) , 10.5281/ZENODO.36896
David M. W. Powers, Trent W. Lewis, Audio-Visual Speech Recognition using Red Exclusion and Neural Networks. Journal of Research and Practice in Information Technology. ,vol. 35, pp. 41- 64 ,(2003)
A. Adjoudani, C. Benoît, On the Integration of Auditory and Visual Parameters in an HMM-based ASR Springer, Berlin, Heidelberg. pp. 461- 471 ,(1996) , 10.1007/978-3-662-13015-5_35
Lionel Revéret, Christian Benoît, A Viseme-based Approach to Labiometrics for Automatic Lipreading AVBPA '97 Proceedings of the First International Conference on Audio- and Video-Based Biometric Person Authentication. pp. 335- 342 ,(1997) , 10.1007/BFB0016013
Shin-ya Takahashi, Tsuyoshi Morimoto, Sakashi Maeda, Naoyuki Tsuruta, Dialogue Experiment for Elderly People in Home Health Care System text speech and dialogue. pp. 418- 423 ,(2003) , 10.1007/978-3-540-39398-6_60
Tsuhan Chen, R.R. Rao, Audio-visual integration in multimodal communication Proceedings of the IEEE. ,vol. 86, pp. 837- 852 ,(1998) , 10.1109/5.664274
Kathleen E. Finn, Allen A. Montgomery, Automatic optically-based recognition of speech Pattern Recognition Letters. ,vol. 8, pp. 159- 164 ,(1988) , 10.1016/0167-8655(88)90094-3
Juergen Luettin, Neil A. Thacker, Speechreading using Probabilistic Models Computer Vision and Image Understanding. ,vol. 65, pp. 163- 178 ,(1997) , 10.1006/CVIU.1996.0570
Shigeo Morishima, Shin Ogata, Kazumasa Murai, Satoshi Nakamura, Audio-visual speech translation with automatic lip syncqronization and face tracking based on 3-D head model IEEE International Conference on Acoustics Speech and Signal Processing. ,vol. 2, pp. 2117- 2120 ,(2002) , 10.1109/ICASSP.2002.5745053