Model-based compensation of the additive noise for continuous speech recognition. Experiments using the AURORA II database and tasks

作者: A. M. Peinado , J. C. Segura , M. C. Benitez

DOI:

关键词:

摘要: In this paper we apply a model-based compensation method to cancel the effect of additive noise in Automatic Speech Recognition systems. The is formulated statistical framework order perform optimal given observed noisy speech, model describing statistics speech recorded clean reference environment and estimation recognition environment. estimated using first frames sentence be recognized frame-by-frame algorithm performed, so that procedure does not constrain real-time systems compatible with emerging technologies based on distributed recognition. We have performed experiments under conditions AURORA II database for tasks developed as standard reference. Experiments been carried out including both, multicondition training approaches. experimental results show improvements performance when proposed applied.

参考文章(7)
Brian S. Eberman, Pedro J. Moreno, A new algorithm for robust speech recognition: the delta vector taylor series approach. conference of the international speech communication association. ,(1997)
M. J. F. Gales, Steve J. Young, HMM recognition in noise using parallel model combination. conference of the international speech communication association. ,(1993)
Jerome R. Bellegarda, Statistical techniques for robust ASR: review and perspectives. conference of the international speech communication association. ,(1997)
Richard M. Stern, Bhiksha Raj, Pedro J. Moreno, COMPENSATION FOR ENVIRONMENTAL DEGRADATION IN AUTOMATIC SPEECH RECOGNITION ,(1999)
Pedro J. Moreno, Speech recognition in noisy environments Carnegie Mellon University. ,(1996)
C.R. Jankowski, H.-D.H. Vo, R.P. Lippmann, A comparison of signal processing front ends for automatic word recognition IEEE Transactions on Speech and Audio Processing. ,vol. 3, pp. 286- 293 ,(1995) , 10.1109/89.397093
Yifan Gong, Speech recognition in noisy environments: a survey Speech Communication. ,vol. 16, pp. 261- 291 ,(1995) , 10.1016/0167-6393(94)00059-J