Improved hidden Markov modeling for speaker-independent continuous speech recognition

作者: Xuedong Huang , Fil Alleva , Satoru Hayamizu , Hsiao-Wuen Hon , Mei-Yuh Hwang

DOI: 10.3115/116580.116687

关键词:

摘要: The paper reports recent efforts to further improve the performance of Sphinx system for speaker-independent continuous speech recognition. recognition error rate is significantly reduced with incorporation additional dynamic features, semi-continuous hidden Markov models, and speaker clustering. For June 1990 (RM2) evaluation test set, rates our current are 4.3% 19.9% word-pair grammar no respectively.

参考文章(18)
Kai-Fu Lee, Hidden Markov models: past, present, and future. conference of the international speech communication association. pp. 1148- 1155 ,(1989)
Raj Reddy, Kai-Fu Lee, Automatic Speech Recognition: The Development of the Sphinx Recognition System Kluwer Academic Publishers. ,(1988)
M.J. Russel, K.M. Ponting, S.M. Peeling, S.R. Browning, J.S. Bridle, R.K. Moore, I. Galiano, P. Howell, The ARM continuous speech recognition system international conference on acoustics, speech, and signal processing. pp. 69- 72 ,(1990) , 10.1109/ICASSP.1990.115539
Hsiao-Wuen Hon, Satoru Hayamizu, Kai-Fu Lee, Description of acoustic variations by hidden Markov models with tree structure ,(1990)
Raj Reddy, Kai-Fu Lee, Large-vocabulary speaker-independent continuous speech recognition: the sphinx system Carnegie Mellon University. ,(1988)
G.R. Doddington, Phonetically sensitive discriminants for improved speech recognition international conference on acoustics, speech, and signal processing. pp. 556- 559 ,(1989) , 10.1109/ICASSP.1989.266487
S. Furui, On the use of hierarchical spectral dynamics in speech recognition international conference on acoustics, speech, and signal processing. pp. 789- 792 ,(1990) , 10.1109/ICASSP.1990.115927
S. Furui, Speaker-independent isolated word recognition using dynamic features of speech spectrum IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 34, pp. 52- 59 ,(1986) , 10.1109/TASSP.1986.1164788
L. R. Rabiner, B. H. Juang, Hidden Markov models for speech recognition Technometrics. ,vol. 33, pp. 251- 272 ,(1991) , 10.2307/1268779