Lip event detection using oriented histograms of regional optical flow and low rank affinity pursuit

作者: Xin Liu , Yiu-ming Cheung , Yuan Yan Tang

DOI: 10.1016/J.CVIU.2015.11.015

关键词:

摘要: An efficient oriented histograms of regional optical flow (OH-ROF) is presented to discriminatively code the visual appearance each lip motion frame.Each clip represented by a sequence OH-ROF vectors as its signature.We introduce stabilization scheme reduce impact irrelevant motions.We address an approach detecting silence event via small magnitude.We propose low rank affinity pursuit method determine lip-dynamic states mouth opening and closing. Lip detection crucial importance better understanding speech perceptually between humans computers. In this paper, we using pursuit. First, align extracted region sequences caused moving cameras. Then, field calculated from these sequentially stabilized images descriptor, namely OH-ROF, frame, whereby can be signature. Subsequently, detect based on magnitude, further that incorporates As result, various kind events appropriately estimated. The proposed neither requires any training set labeled videos nor learns priors in unconstrained video. Experiments show promising result comparison with state-of-the-art counterparts.

参考文章(37)
F. Luthon, M. Lievin, Lip motion automatic detection scandinavian conference on image analysis. pp. 253- 260 ,(1997)
Conrad Sanderson, Brian C. Lovell, Multi-Region Probabilistic Histograms for Robust and Scalable Identity Inference international conference on biometrics. ,vol. 5558, pp. 199- 208 ,(2009) , 10.1007/978-3-642-01793-3_21
J. Wright, Wenli Xu, Yi Ma, Yigang Peng, A. Ganesh, RASL: Robust Alignment by Sparse and Low-Rank Decomposition for Linearly Correlated Images IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 34, pp. 2233- 2246 ,(2012) , 10.1109/TPAMI.2011.282
Rajitha Navarathna, David Dean, Sridha Sridharan, Clinton Fookes, Patrick Lucey, Visual Voice Activity Detection Using Frontal versus Profile Views digital image computing: techniques and applications. pp. 134- 139 ,(2011) , 10.1109/DICTA.2011.29
Hamed Talea, Khashayar Yaghmaie, Automatic visual speech segmentation ieee international conference on communication software and networks. pp. 184- 188 ,(2011) , 10.1109/ICCSN.2011.6014877
A.J. Aubrey, Y.A. Hicks, J.A. Chambers, Visual voice activity detection with optical flow Iet Image Processing. ,vol. 4, pp. 463- 472 ,(2010) , 10.1049/IET-IPR.2009.0042
Deqing Sun, Stefan Roth, Michael J. Black, Secrets of optical flow estimation and their principles computer vision and pattern recognition. pp. 2432- 2439 ,(2010) , 10.1109/CVPR.2010.5539939
Simon Baker, Iain Matthews, Lucas-Kanade 20 Years On: A Unifying Framework International Journal of Computer Vision. ,vol. 56, pp. 221- 255 ,(2004) , 10.1023/B:VISI.0000011205.11775.FD
Xin Liu, Yiu-ming Cheung, Shu-Juan Peng, Zhen Cui, Bineng Zhong, Ji-Xiang Du, None, Automatic motion capture data denoising via filtered subspace clustering and low rank matrix approximation Signal Processing. ,vol. 105, pp. 350- 362 ,(2014) , 10.1016/J.SIGPRO.2014.06.009