Speech recognition system for recognizing continuous and isolated speech

作者: Fileno A Alleva , Mei-Yuh Hwang , Xuedong D Huang , Li Jiang

DOI:

关键词: Speech recognitionAcoustic modelSpeech processingComputer scienceSpeech trainingAudio mining

摘要: Speech recognition is performed by receiving isolated speech training data (step 98) indicative of a plurality discretely spoken words, and continuous 86) continuously words. A unit models trained based on the data. recognized trained.

参考文章(12)
Masao C, Hiroaki C, Hiromi C, Continuous word recognition system ,(1986)
Shoji Kuriki, Speech recognition apparatus including speaker-independent dictionary and speaker-dependent Journal of the Acoustical Society of America. ,vol. 95, pp. 1185- 1185 ,(1990) , 10.1121/1.408411
Yunxin Zhao, Training module for estimating mixture Gaussian densities for speech unit models in speech recognition systems Journal of the Acoustical Society of America. ,vol. 95, pp. 2303- 2303 ,(1990) , 10.1121/1.408599
George R. Doddington, Michael L. McMahan, Connected word recognition enrollment method Journal of the Acoustical Society of America. ,vol. 89, pp. 491- 491 ,(1986) , 10.1121/1.400437
Method of optimizing a composite speech recognition expert Journal of the Acoustical Society of America. ,vol. 96, pp. 3834- 3834 ,(1991) , 10.1121/1.410493
Yunxin Zhao, Self-learning speaker adaptation based on spectral bias source decomposition, using very short calibration speech Journal of the Acoustical Society of America. ,vol. 105, pp. 1450- ,(1996) , 10.1121/1.426675
Jie Yi, Speech recognition method and system using triphones, diphones, and phonemes Journal of the Acoustical Society of America. ,vol. 100, pp. 3488- ,(1992) , 10.1121/1.417271
Xuedong Huang, A. Acero, F. Alleva, Mei-Yuh Hwang, Li Jiang, M. Mahajan, Microsoft Windows highly intelligent speech recognizer: Whisper international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 93- 96 ,(1995) , 10.1109/ICASSP.1995.479281
Mei-Yuh Hwang, Xuedong Huang, F.A. Alleva, Predicting unseen triphones with senones IEEE Transactions on Speech and Audio Processing. ,vol. 4, pp. 412- 419 ,(1996) , 10.1109/89.544526