LIP CONTOUR DETECTION TECHNIQUES BASED ON FRONT VIEW OF FACE

作者: Samir K. Bandyopadhyay

DOI:

关键词: Lip trackingFrame (networking)Computer scienceContour analysisProcess (computing)Tracking (particle physics)Computer visionPosition (vector)Face (geometry)Speech readingArtificial intelligence

摘要: Lip contour detection and tracking is the most important pre-requisite for computerized speech reading. Several approaches have been proposed lip after accurately initialized on first frame. Detection of an issue in A relatively large class reading algorithms are available based analysis. In these cases, extraction needed as step. By extraction, we usually refer to process frame audio-visual image sequence. Obtaining subsequent frames referred tracking. While there well developed techniques perform this task automatically, case things different. This a much more difficult than tracking, due lack good a-priori information respect mouth position image, size, approximate shape mouth, opening etc. paper propose solution automatic if front view face available. The method has tested database containing images different people was found maximum success rate 85%.

参考文章(11)
David M. W. Powers, Trent W. Lewis, Audio-Visual Speech Recognition using Red Exclusion and Neural Networks. Journal of Research and Practice in Information Technology. ,vol. 35, pp. 41- 64 ,(2003)
Robert Kaucic, Barney Dalton, Andrew Blake, Real-Time Lip Tracking for Audio-Visual Speech Recognition Applications european conference on computer vision. pp. 376- 387 ,(1996) , 10.1007/3-540-61123-1_154
Mohammad Sadeghi, Josef Kittler, Kieron Messer, Real Time Segmentation of Lip Pixels for Lip Tracker Initialization computer analysis of images and patterns. pp. 317- 324 ,(2001) , 10.1007/3-540-44692-3_39
Ara V. Nefian, Luhong Liang, Xiaobo Pi, Xiaoxing Liu, Kevin Murphy, Dynamic Bayesian Networks for Audio-Visual Speech Recognition EURASIP Journal on Advances in Signal Processing. ,vol. 2002, pp. 1274- 1288 ,(2002) , 10.1155/S1110865702206083
P. Kuo, Improved lip fitting and tracking for model-based multimedia and coding IEE International Conference on Visual Information Engineering (VIE 2005). pp. 251- 258 ,(2005) , 10.1049/CP:20050097
G. Pomianos, C. Neti, G. Gravier, A. Garg, A.W. Senior, Recent advances in the automatic recognition of audiovisual speech Proceedings of the IEEE. ,vol. 91, pp. 1306- 1326 ,(2003) , 10.1109/JPROC.2003.817150
R. Stiefelhagen, Jie Yang, A. Waibel, A model-based gaze tracking system Proceedings IEEE International Joint Symposia on Intelligence and Systems. pp. 304- 310 ,(1996) , 10.1109/IJSIS.1996.565083
Xiaozheng Zhang, Charles C. Broun, Russell M. Mersereau, Mark A. Clements, Automatic Speechreading with Applications to Human-Computer Interfaces EURASIP Journal on Advances in Signal Processing. ,vol. 2002, pp. 1228- 1247 ,(2002) , 10.1155/S1110865702206137
N. Eveno, A. Caplier, P.-Y. Coulon, Accurate and quasi-automatic lip tracking IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 14, pp. 706- 715 ,(2004) , 10.1109/TCSVT.2004.826754
C. Neti, G. Potamianos, J. Luettin, I. Matthews, H. Glotin, D. Vergyri, Large-vocabulary audio-visual speech recognition: a summary of the Johns Hopkins Summer 2000 Workshop multimedia signal processing. pp. 619- 624 ,(2001) , 10.1109/MMSP.2001.962801