Resegmentation of SWITCHBOARD.

作者: Jonathan Hamaker , Neeraj Deshmukh , Aravind Ganapathiraju , Andi Gleeson , Joseph Picone

DOI:

关键词:

摘要: The SWITCHBOARD (SWB) corpus is one of the most important benchmarks for recognition tasks involving large vocabulary conversational speech (LVCSR). high error rates on SWB are largely attributable to an acoustic model mismatch, frequency poorly articulated monosyllabic words, and variations in pronunciations. It imperative improve quality segmentations transcriptions training data achieve better modeling. By adapting existing models only a small subset such improved transcriptions, we have achieved 2% absolute improvement performance.

参考文章(1)
J.J. Godfrey, E.C. Holliman, J. McDaniel, SWITCHBOARD: telephone speech corpus for research and development international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 517- 520 ,(1992) , 10.1109/ICASSP.1992.225858