Multi-stream ASR trained with heterogeneous reverberant environments

作者： M.L. Shire

DOI: 10.1109/ICASSP.2001.940815

关键词: Computer science 、 Reverberation 、 Feature (machine learning) 、 Pattern recognition 、 Multi stream 、 Artificial intelligence 、 Acoustic model 、 Speech recognition 、 Hidden Markov model

摘要: A common problem with automatic speech recognition (ASR) systems is that the performance degrades when it presented from a different acoustic environment than one used during training. An important cause feature distribution to which ASR system trained no longer matches of new environment. Reverberant environments can be especially harmful. We test multi-stream in constituent streams are each separate environments. When training modeling stages separately clean data and heavily reverberated data, we find combined improve unseen data.

uni-trier.de 本地加速

sci-hub.se PDF 下载加速

参考文章(8)

Katrin Kirchhoff, Jeff A. Bilmes, Directed graphical models of classifier combination: application to phone recognition. conference of the international speech communication association. pp. 921- ,(2000)

Herve A. Bourlard, Nelson Morgan, Connectionist Speech Recognition: A Hybrid Approach Kluwer Academic Publishers. ,(1993)

Jeff A. Bilmes, Daniel P. W. Ellis, Using mutual information to design feature combinations international conference on spoken language processing. pp. 1- 4 ,(2000) , 10.7916/D8JQ19C8

Brian E.D Kingsbury, Nelson Morgan, Steven Greenberg, Robust speech recognition using the modulation spectrogram Speech Communication. ,vol. 25, pp. 117- 132 ,(1998) , 10.1016/S0167-6393(98)00032-6

Hynek Hermansky, Perceptual linear predictive (PLP) analysis of speech Journal of the Acoustical Society of America. ,vol. 87, pp. 1738- 1752 ,(1990) , 10.1121/1.399423

H. Hermansky, N. Morgan, RASTA processing of speech IEEE Transactions on Speech and Audio Processing. ,vol. 2, pp. 578- 589 ,(1994) , 10.1109/89.326616

T. Robinson, J. Christie, Time-first search for large vocabulary speech recognition international conference on acoustics speech and signal processing. ,vol. 2, pp. 829- 832 ,(1998) , 10.1109/ICASSP.1998.675393

J. Kittler, M. Hatef, R.P.W. Duin, J. Matas, On combining classifiers IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 20, pp. 226- 239 ,(1998) , 10.1109/34.667881

Multi-stream ASR trained with heterogeneous reverberant environments

来源期刊

我的账户

Multi-stream ASR trained with heterogeneous reverberant environments

来源期刊

相似文章 4

Combining formant frequency based on variable order LPC coding with acoustic features for TIMIT phone recognition

Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR

Multi-stream Processing for Noise Robust Speech Recognition

Acoustic feature combination for speech recognition

我的账户