Multi-stream ASR trained with heterogeneous reverberant environments

作者: M.L. Shire

DOI: 10.1109/ICASSP.2001.940815

关键词: Computer scienceReverberationFeature (machine learning)Pattern recognitionMulti streamArtificial intelligenceAcoustic modelSpeech recognitionHidden Markov model

摘要: A common problem with automatic speech recognition (ASR) systems is that the performance degrades when it presented from a different acoustic environment than one used during training. An important cause feature distribution to which ASR system trained no longer matches of new environment. Reverberant environments can be especially harmful. We test multi-stream in constituent streams are each separate environments. When training modeling stages separately clean data and heavily reverberated data, we find combined improve unseen data.

参考文章(8)
Katrin Kirchhoff, Jeff A. Bilmes, Directed graphical models of classifier combination: application to phone recognition. conference of the international speech communication association. pp. 921- ,(2000)
Herve A. Bourlard, Nelson Morgan, Connectionist Speech Recognition: A Hybrid Approach Kluwer Academic Publishers. ,(1993)
Jeff A. Bilmes, Daniel P. W. Ellis, Using mutual information to design feature combinations international conference on spoken language processing. pp. 1- 4 ,(2000) , 10.7916/D8JQ19C8
Brian E.D Kingsbury, Nelson Morgan, Steven Greenberg, Robust speech recognition using the modulation spectrogram Speech Communication. ,vol. 25, pp. 117- 132 ,(1998) , 10.1016/S0167-6393(98)00032-6
Hynek Hermansky, Perceptual linear predictive (PLP) analysis of speech Journal of the Acoustical Society of America. ,vol. 87, pp. 1738- 1752 ,(1990) , 10.1121/1.399423
H. Hermansky, N. Morgan, RASTA processing of speech IEEE Transactions on Speech and Audio Processing. ,vol. 2, pp. 578- 589 ,(1994) , 10.1109/89.326616
T. Robinson, J. Christie, Time-first search for large vocabulary speech recognition international conference on acoustics speech and signal processing. ,vol. 2, pp. 829- 832 ,(1998) , 10.1109/ICASSP.1998.675393
J. Kittler, M. Hatef, R.P.W. Duin, J. Matas, On combining classifiers IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 20, pp. 226- 239 ,(1998) , 10.1109/34.667881