Phoneme based speech recognition

作者： Vishwa N. Gupta

关键词:

摘要: A flexible vocabulary speech recognition system is provided for recognizing transmitted via the public switched telephone network. The (FVR) a phoneme based system. phonemes are modelled as hidden Markov models. represented concatenated models trained using Viterbi training enhanced by: substituting covariance matrix of given by others, applying energy level thresholds and voiced, unvoiced, silence labelling constraints during training. Specific members, such digits, allophone A* searching lexical network facilitated providing reduced which provides estimate scores used to evaluate path through Joint rejection out-of-vocabulary words both cepstrum LSP parameter vectors.

参考文章(13)

L. Deng, V. Gupta, M. Lennig, P. Kenny, P. Mermelstein, Acoustic recognition component of an 86000-word speech recognizer international conference on acoustics, speech, and signal processing. pp. 741- 744 ,(1990) , 10.1109/ICASSP.1990.115896

Laurence Gillick, Method for representing word models for use in speech recognition Journal of the Acoustical Society of America. ,vol. 92, pp. 629- 629 ,(1989) , 10.1121/1.404106

Frederick Jelinek, Continuous speech recognition by statistical methods Proceedings of the IEEE. ,vol. 64, pp. 532- 556 ,(1976) , 10.1109/PROC.1976.10159

Yunxin Zhao, Training module for estimating mixture Gaussian densities for speech unit models in speech recognition systems Journal of the Acoustical Society of America. ,vol. 95, pp. 2303- 2303 ,(1990) , 10.1121/1.408599

Apparatus and method of grouping utterances of a phoneme into context-dependent categories based on sound-similarity for automatic speech recognition Journal of the Acoustical Society of America. ,vol. 95, pp. 3688- 3688 ,(1992) , 10.1121/1.409891

James K. Baker, Laurence Gillick, Method for speech recognition Journal of the Acoustical Society of America. ,vol. 88, pp. 1672- 1672 ,(1987) , 10.1121/1.400232

Stephen E. Levinson, Hidden Markov model speech recognition arrangement Journal of the Acoustical Society of America. ,vol. 86, pp. 2478- 2478 ,(1982) , 10.1121/1.398355

Peter F. Brown, Automatic determination of labels and markov word models in a speech recognition system Journal of the Acoustical Society of America. ,vol. 93, pp. 3542- 3542 ,(1988) , 10.1121/1.405348

M. Lennig, Putting speech recognition to work in the telephone network IEEE Computer. ,vol. 23, pp. 35- 41 ,(1990) , 10.1109/2.56869

10.

L. Rabiner, M. Sambur, Application of an LPC distance measure to the voiced-unvoiced-silence detection problem IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 25, pp. 338- 343 ,(1977) , 10.1109/TASSP.1977.1162964

Phoneme based speech recognition

来源期刊

我的账户

Phoneme based speech recognition

来源期刊

相似文章 10

我的账户