Comparison of colour transforms used in lip segmentation algorithms

作者: Ashley D. Gritzman , David M. Rubin , Adam Pantanowitz

DOI: 10.1007/S11760-014-0615-X

关键词: RGB color modelHueSegmentationComputer visionCIELUVScale-space segmentationArtificial intelligenceYCbCrComputer scienceSegmentation-based object categorizationHSL and HSV

摘要: Lip segmentation is a fundamental system component in range of applications including: automatic lip reading, emotion recognition and biometric speaker identification. The first step involves applying colour transform to enhance the contrast between lips surrounding skin. However, there much debate among researchers as best for this task. As such, article presents most comprehensive study date by evaluating 33 transforms segmentation: 21 channels from seven space models (RGB, HSV, YCbCr, YIQ, CIEXYZ, CIELUV CIELAB) 12 additional (8 which are designed specifically segmentation). comparison extended determine segment oral cavity. Histogram intersection Otsu’s discriminant used quantify compare transforms. Results lip–skin validate experimental approach, 11 top literature. necessity selecting correct demonstrated an increase accuracy up three times. Hue-based including pseudo hue domain filtering perform segmentation, with HSV achieving greatest 93.85 %. a* CIELAB performs lip–oral cavity while LUX reasonably well both segmentation.

参考文章(33)
Thomas Dziurzyk, Ulrich Canzler, Extraction of Non Manual Features for Videobased Sign Language Recognition. Journal of Machine Vision and Applications. pp. 318- 321 ,(2002)
D.G. Stork, M.E. Hennecke, Speechreading: an overview of image processing, feature extraction, sensory integration and pattern recognition techniques international conference on automatic face and gesture recognition. ,(1996) , 10.1109/AFGR.1996.557235
Tarcisio Coianiz, Lorenzo Torresani, Bruno Caprile, 2D Deformable Models for Visual Speech Analysis Springer, Berlin, Heidelberg. pp. 391- 398 ,(1996) , 10.1007/978-3-662-13015-5_29
AM Martinez, R Benavente, The AR face database CVC Technical Report24. ,vol. 24, ,(1998)
N. Eveno, A. Caplier, P.-Y. Coulon, New color transformation for lips segmentation multimedia signal processing. pp. 3- 8 ,(2001) , 10.1109/MMSP.2001.962702
S.L. Wang, W.H. Lau, S.H. Leung, H. Yan, A real-time automatic lipreading system international symposium on circuits and systems. ,vol. 2, pp. 101- 104 ,(2004) , 10.1109/ISCAS.2004.1329218
Yu-Ichi Ohta, Takeo Kanade, Toshiyuki Sakai, Color information for region segmentation Computer Graphics and Image Processing. ,vol. 13, pp. 222- 241 ,(1980) , 10.1016/0146-664X(80)90047-7
Hamed Talea, Khashayar Yaghmaie, Automatic visual speech segmentation ieee international conference on communication software and networks. pp. 184- 188 ,(2011) , 10.1109/ICCSN.2011.6014877
Omer Demirkaya, Musa H. Asyali, Determination of image bimodality thresholds for different intensity distributions Signal Processing-image Communication. ,vol. 19, pp. 507- 516 ,(2004) , 10.1016/J.IMAGE.2004.04.002