Phoneme based speech recognition

作者: Vishwa N. Gupta

DOI: 10.1121/1.413457

关键词:

摘要: A flexible vocabulary speech recognition system is provided for recognizing transmitted via the public switched telephone network. The (FVR) a phoneme based system. phonemes are modelled as hidden Markov models. represented concatenated models trained using Viterbi training enhanced by: substituting covariance matrix of given by others, applying energy level thresholds and voiced, unvoiced, silence labelling constraints during training. Specific members, such digits, allophone A* searching lexical network facilitated providing reduced which provides estimate scores used to evaluate path through Joint rejection out-of-vocabulary words both cepstrum LSP parameter vectors.

参考文章(13)
L. Deng, V. Gupta, M. Lennig, P. Kenny, P. Mermelstein, Acoustic recognition component of an 86000-word speech recognizer international conference on acoustics, speech, and signal processing. pp. 741- 744 ,(1990) , 10.1109/ICASSP.1990.115896
Laurence Gillick, Method for representing word models for use in speech recognition Journal of the Acoustical Society of America. ,vol. 92, pp. 629- 629 ,(1989) , 10.1121/1.404106
Frederick Jelinek, Continuous speech recognition by statistical methods Proceedings of the IEEE. ,vol. 64, pp. 532- 556 ,(1976) , 10.1109/PROC.1976.10159
Yunxin Zhao, Training module for estimating mixture Gaussian densities for speech unit models in speech recognition systems Journal of the Acoustical Society of America. ,vol. 95, pp. 2303- 2303 ,(1990) , 10.1121/1.408599
James K. Baker, Laurence Gillick, Method for speech recognition Journal of the Acoustical Society of America. ,vol. 88, pp. 1672- 1672 ,(1987) , 10.1121/1.400232
Stephen E. Levinson, Hidden Markov model speech recognition arrangement Journal of the Acoustical Society of America. ,vol. 86, pp. 2478- 2478 ,(1982) , 10.1121/1.398355
Peter F. Brown, Automatic determination of labels and markov word models in a speech recognition system Journal of the Acoustical Society of America. ,vol. 93, pp. 3542- 3542 ,(1988) , 10.1121/1.405348
M. Lennig, Putting speech recognition to work in the telephone network IEEE Computer. ,vol. 23, pp. 35- 41 ,(1990) , 10.1109/2.56869
L. Rabiner, M. Sambur, Application of an LPC distance measure to the voiced-unvoiced-silence detection problem IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 25, pp. 338- 343 ,(1977) , 10.1109/TASSP.1977.1162964