Different strategies for distribution clustering using discrete, semicontinuous and continuous HMMs in CSR

作者: R. de Cordoba , J.M. Pardo

DOI: 10.1109/ICSLP.1996.607798

关键词: InterpolationContext modelLoudspeakerHidden Markov modelError reductionArtificial intelligenceRobustness (computer science)SmoothingPattern recognitionComputer scienceCluster analysis

摘要: The authors present an overview of different strategies and refinements to share parameters in HMM models at distribution (state) level for continuous speech recognition, showing the advantages drawbacks kinds modeling. They compare them with sharing model level, achieving error reduction close 20%. Discrete, semicontinuous are also compared using these approaches. consider two ways smooth discrete distributions (interpolate detailed context dependent robust independent) derived from deleted interpolation co-occurrence smoothing.

参考文章(7)
José Colás, Ricardo de Córdoba, José Manuel Pardo, Improving and optimizing speaker independent, 1000 words speech recognition in Spanish. conference of the international speech communication association. ,(1992)
Mei-Yuh Hwang, Hsiao-Wuen Hon, Kai-Fu Lee, Modeling between-word coarticulation in continuous speech recognition. conference of the international speech communication association. pp. 1005- 1008 ,(1989)
J. Ferreiros, R. de Córdoba, M. H. Savoji, J. M. Pardo, Continuous Speech HMM Training System: Applications to Speech Recognition and Phonetic Label Alignment Springer Berlin Heidelberg. pp. 68- 71 ,(1995) , 10.1007/978-3-642-57745-1_8
Javier Macías Guarasa, Ricardo de Córdoba, Xavier Menéndez-Pidal, José Manuel Pardo, Ascensión Gallardo-Antolín, Development and improvement of a real-time ASR system for isolated digits in Spanish over the telephone line. conference of the international speech communication association. ,(1995)
Lalit R. Bahl, Frederick Jelinek, Robert L. Mercer, A Maximum Likelihood Approach to Continuous Speech Recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. PAMI-5, pp. 179- 190 ,(1983) , 10.1109/TPAMI.1983.4767370
X.D. Huang, H.W. Hon, M.Y. Hwang, K.F. Lee, A comparative study of discrete, semicontinuous, and continuous hidden Markov models Computer Speech & Language. ,vol. 7, pp. 359- 368 ,(1993) , 10.1006/CSLA.1993.1019
Mei‐Yuh Hwang, Hsiao‐Wuen Hon, Kai‐Fu Lee, Interword coarticulation modeling for continuous speech recognition Journal of the Acoustical Society of America. ,vol. 85, ,(1989) , 10.1121/1.2026700