Investigating back propagation neural network for lip reading

作者: Neeru Rathee

DOI: 10.1109/CCAA.2016.7813748

关键词:

摘要: Visual Speech processing is the key concern of researchers working in field speech and computer vision. Though earlier audio was popularly used for recognition but their performance deteriorated presence noise. Moreover, variation accent another challenge that affects such systems. In presented paper, we explore lip texture features visemes. The variations temporal behavior coded using Local Binary Pattern three orthogonal planes. classification carried out back propagation neural network, which a network with hidden layer. added advantage layer reading it takes into account nonlinear while speaking. proposed approach Hindi word achieved high accuracy at cost computation time.

参考文章(16)
Georgios Tzimiropoulos, Joan Alabort-i-Medina, Stefanos Zafeiriou, Maja Pantic, Generic active appearance models revisited asian conference on computer vision. ,vol. 7726, pp. 650- 663 ,(2012) , 10.1007/978-3-642-37431-9_50
Jie Yan, None, Ensemble SVM Regression Based Multi-View Face Detection System international workshop on machine learning for signal processing. pp. 163- 169 ,(2007) , 10.1109/MLSP.2007.4414300
Say Wei Foo, Eng Guan Lim, Speaker recognition using adaptively boosted classifier ieee region 10 conference. ,vol. 1, pp. 442- 446 ,(2001) , 10.1109/TENCON.2001.949632
P. Viola, M. Jones, Rapid object detection using a boosted cascade of simple features computer vision and pattern recognition. ,vol. 1, pp. 511- 518 ,(2001) , 10.1109/CVPR.2001.990517
Xiaohua Huang, Guoying Zhao, Wenming Zheng, Matti Pietikainen, Spatiotemporal Local Monogenic Binary Patterns for Facial Expression Recognition IEEE Signal Processing Letters. ,vol. 19, pp. 243- 246 ,(2012) , 10.1109/LSP.2012.2188890
T.F. Cootes, C.J. Taylor, D.H. Cooper, J. Graham, Active shape models—their training and application Computer Vision and Image Understanding. ,vol. 61, pp. 38- 59 ,(1995) , 10.1006/CVIU.1995.1004
Timo Ojala, Matti Pietikäinen, David Harwood, A comparative study of texture measures with classification based on featured distributions Pattern Recognition. ,vol. 29, pp. 51- 59 ,(1996) , 10.1016/0031-3203(95)00067-4
Y LAY, C TSAI, H YANG, C LIN, C LAI, The application of extension neuro-network on computer-assisted lip-reading recognition for hearing impaired Expert Systems With Applications. ,vol. 34, pp. 1465- 1473 ,(2008) , 10.1016/J.ESWA.2007.01.042
Xin Liu, Yiu-ming Cheung, Learning Multi-Boosted HMMs for Lip-Password Based Speaker Verification IEEE Transactions on Information Forensics and Security. ,vol. 9, pp. 233- 246 ,(2014) , 10.1109/TIFS.2013.2293025
Di Huang, Caifeng Shan, Mohsen Ardabilian, Yunhong Wang, Liming Chen, Local Binary Patterns and Its Application to Facial Image Analysis: A Survey systems man and cybernetics. ,vol. 41, pp. 765- 781 ,(2011) , 10.1109/TSMCC.2011.2118750