A study of irrelevant variability normalization based training and unsupervised online adaptation for LVCSR.

作者: Yu Shi , Qiang Huo , Guangchuan Shi

DOI:

关键词: Computer scienceVocabularyNormalization (statistics)Word error rateOnline adaptationSpeech recognition

摘要: This paper presents an experimental study of a maximum likelihood (ML) approach to irrelevant variability normalization (IVN) based training and unsupervised online adaptation for large vocabulary continuous speech recognition. A moving window frame labeling method is used acoustic sniffing. The IVN-based achieves 10% relative word error rate reduction over ML-trained baseline system on Switchboard-1 conversational telephone transcription task.

参考文章(1)
J.J. Godfrey, E.C. Holliman, J. McDaniel, SWITCHBOARD: telephone speech corpus for research and development international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 517- 520 ,(1992) , 10.1109/ICASSP.1992.225858