Robust speech recognition with speech enhanced deep neural networks.

作者： Tian Gao , Li-Rong Dai , Chin-Hui Lee , Qing Wang , Jun Du

DOI:

关键词: Speech recognition 、 Deep neural networks 、 Computer science 、 Time delay neural network

摘要:

参考文章(19)

Alex Acero, Li Deng, Jasha Droppo, Evaluation of the SPLICE algorithm on the Aurora2 database. conference of the international speech communication association. pp. 217- 220 ,(2001)

William Hartmann, Arun Narayanan, Eric Fosler-Lussier, DeLiang Wang, A Direct Masking Approach to Robust ASR IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 21, pp. 1993- 2005 ,(2013) , 10.1109/TASL.2013.2263802

Jun Du, Yu Hu, Li-Rong Dai, Ren-Hua Wang, HMM-based pseudo-clean speech synthesis for splice algorithm 2010 IEEE International Conference on Acoustics, Speech and Signal Processing. pp. 4570- 4573 ,(2010) , 10.1109/ICASSP.2010.5495569

Abdel-rahman Mohamed, George E. Dahl, Geoffrey Hinton, Acoustic Modeling Using Deep Belief Networks IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 20, pp. 14- 22 ,(2012) , 10.1109/TASL.2011.2109382

M. Afify, X. Cui, Y. Gao, Stereo-Based Stochastic Mapping for Robust Speech Recognition IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 17, pp. 1325- 1334 ,(2009) , 10.1109/TASL.2009.2018017

Arun Narayanan, DeLiang Wang, Investigation of speech separation as a front-end for noise robust speech recognition IEEE Transactions on Audio, Speech, and Language Processing. ,vol. 22, pp. 826- 835 ,(2014) , 10.1109/TASLP.2014.2305833

Michael L. Seltzer, Dong Yu, Yongqiang Wang, An investigation of deep neural networks for noise robust speech recognition international conference on acoustics, speech, and signal processing. pp. 7398- 7402 ,(2013) , 10.1109/ICASSP.2013.6639100

Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee, An Experimental Study on Speech Enhancement Based on Deep Neural Networks IEEE Signal Processing Letters. ,vol. 21, pp. 65- 68 ,(2014) , 10.1109/LSP.2013.2291240

Yifan Gong, Speech recognition in noisy environments: a survey Speech Communication. ,vol. 16, pp. 261- 291 ,(1995) , 10.1016/0167-6393(94)00059-J

Jun Du, Qiang Huo, Synthesized stereo-based stochastic mapping with data selection for robust speech recognition international symposium on chinese spoken language processing. pp. 122- 125 ,(2012) , 10.1109/ISCSLP.2012.6423542