A parametric model for realistic lip segmentation

作者: N. Eveno , A. Caplier , P.-Y. Coulon

DOI: 10.1109/ICARCV.2002.1234982

关键词:

摘要: Lip segmentation is an essential stage in many multimedia systems such as videoconferencing, lip reading, or low bit rate coding communication systems. In this paper, we propose accurate and robust algorithm. First, the mouth region several characteristic points are detected by using "hybrid edges" (which combine colour intensity information) a priori knowledge about morphology. Corners position, which crucial, provided coarse-to-fine process. Then, parametric model fitted on lips. We consider that boundary composed of independent cubic polynomial curves. It gives complexity global flexible enough to reproduce specificity very different shapes. Compared existing models, it brings significant accuracy realism improvement. Moreover, ensures convergence towards edges because parts independent.

参考文章(9)
Tarcisio Coianiz, Lorenzo Torresani, Bruno Caprile, 2D Deformable Models for Visual Speech Analysis Springer, Berlin, Heidelberg. pp. 391- 398 ,(1996) , 10.1007/978-3-662-13015-5_29
Juergen Luettin, Neil A. Thacker, Steve W. Beet, Active Shape Models for Visual Speech Feature Extraction Speechreading by Humans and Machines. ,vol. 150, pp. 383- 390 ,(1996) , 10.1007/978-3-662-13015-5_28
A. Hurlbert, T. Poggio, Synthesizing a color algorithm from examples Science. ,vol. 239, pp. 482- 485 ,(1988) , 10.1126/SCIENCE.3340834
D. Terzopoulos, K. Waters, Analysis and synthesis of facial image sequences using physical and anatomical models IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 15, pp. 569- 579 ,(1993) , 10.1109/34.216726
P. Delmas, P.Y. Coulon, V. Fristot, Automatic snakes for robust lip boundaries extraction international conference on acoustics speech and signal processing. ,vol. 6, pp. 3069- 3072 ,(1999) , 10.1109/ICASSP.1999.757489
Michael Kass, Andrew Witkin, Demetri Terzopoulos, Snakes : Active Contour Models International Journal of Computer Vision. ,vol. 1, pp. 321- 331 ,(1988) , 10.1007/BF00133570
X. Zhang, R.M. Mersereau, Lip feature extraction towards an automatic speechreading system international conference on image processing. ,vol. 3, pp. 226- 229 ,(2000) , 10.1109/ICIP.2000.899336
Alan L. Yuille, Peter W. Hallinan, David S. Cohen, Feature extraction from faces using deformable templates International Journal of Computer Vision. ,vol. 8, pp. 99- 111 ,(1992) , 10.1007/BF00127169
K. Venkatesh Hennecke, Marcus, Prasad, David Stork, Using deformable templates to infer visual speech dynamics asilomar conference on signals, systems and computers. ,vol. 1, pp. 578- 582 ,(1994) , 10.1109/ACSSC.1994.471518