Robust automatic time alignment of orthographic transcriptions with unconstrained speech

作者: B. Wheatley , G. Doddington , C. Hemphill , J. Godfrey , E. Holliman

DOI: 10.1109/ICASSP.1992.225853

关键词:

摘要: A method for automatic time alignment of orthographically transcribed speech using supervised speaker-independent recognition based on the orthographic transcription, an online dictionary, and HMM phone models is presented. This successfully aligns transcriptions with in unconstrained 5 to 10 min conversations collected over long-distance telephone lines. It requires minimal manual processing generally produces correct alignments despite challenging nature data. The robustness efficiency make it a practical tool very large corpora. >

参考文章(1)
J.J. Godfrey, E.C. Holliman, J. McDaniel, SWITCHBOARD: telephone speech corpus for research and development international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 517- 520 ,(1992) , 10.1109/ICASSP.1992.225858