Word duration modeling for word graph rescoring in LVCSR.

作者: Daniele Falavigna , Dino Seppi , Georg Stemmer , Roberto Gretter

DOI:

关键词:

摘要:

参考文章(15)
F. Brugnara, D. Falavigna, D. Seppi, G. Stemmer, R. Gretter, D. Pineda, D. Giuliani, The ITC-irst transcription systems for the TC-STAR-06 evaluation campaign ,(2006)
M. Hasegawa-Johnson, K. Chen, PROSODY DEPENDENT SPEECH RECOGNITION ON RADIO NEWS ,(2003)
Jing Zheng, H. Franco, Fuliang Weng, A. Sankar, H. Bratt, Word-level rate of speech modeling using rate-specific phones and pronunciations international conference on acoustics, speech, and signal processing. ,vol. 3, pp. 1775- 1778 ,(2000) , 10.1109/ICASSP.2000.862097
Andreas Stolcke, Elizabeth Shriberg, Dilek Z. Hakkani-Tür, Gökhan Tür, Modeling the prosody of hidden events for improved word recognition. conference of the international speech communication association. ,(1999)
Hervé Bourlard, Hynek Hermansky, Nelson Morgan, Towards increasing speech recognition error rates Speech Communication. ,vol. 18, pp. 205- 231 ,(1996) , 10.1016/0167-6393(96)00003-9
Mikko Kurimo, Janne Pylkkönen, Duration Modeling Techniques for Continuous Speech Recognition conference of the international speech communication association. ,(2004)
L. Fenton, The Sum of Log-Normal Probability Distributions in Scatter Transmission Systems IEEE Transactions on Communications. ,vol. 8, pp. 57- 67 ,(1960) , 10.1109/TCOM.1960.1097606
B. Juang, L. Rabiner, S. Levinson, M. Sondhi, Recent developments in the application of hidden Markov models to speaker-independent isolated word recognition international conference on acoustics, speech, and signal processing. ,vol. 10, pp. 9- 12 ,(1985) , 10.1109/ICASSP.1985.1168453