Senone tree representation and evaluation

作者: Finelo A. Alleva

DOI: 10.1121/1.426676

关键词:

摘要: A speech recognition method provides improved modeling in accuracy using hidden Markov models. During training, the creates a senone tree for each state of phoneme encountered data set training words. All output distributions received selected words are clustered together root node tree. Each beginning with is divided into two nodes by asking linguistic questions regarding phonemes immediately to left and right central triphone. At predetermined point, creation stops, resulting leaves representing known as senones. The trees allow all possible triphones be mapped sequence senones simply traversing associated As result, unseen not can modeled created actually found data.

参考文章(17)
J. Takami, S. Sagayama, A successive state splitting algorithm for efficient allophone modeling international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 573- 576 ,(1992) , 10.1109/ICASSP.1992.225855
James K. Baker, Interactive speech recognition apparatus Journal of the Acoustical Society of America. ,vol. 91, pp. 546- 546 ,(1986) , 10.1121/1.402658
Jed M. Roberts, Speech detection and recognition apparatus for use with background noise of varying levels Journal of the Acoustical Society of America. ,vol. 89, pp. 3026- 3026 ,(1986) , 10.1121/1.400824
Gerarld Moese, Computer system for speech recognition Journal of the Acoustical Society of America. ,vol. 99, pp. 646- ,(1992) , 10.1121/1.414609
Akihiro Kuroda, Speech recognition method Journal of the Acoustical Society of America. ,vol. 94, pp. 3538- 3538 ,(1987) , 10.1121/1.400821
Jed Roberts, Method for interactive speech recognition and training Journal of the Acoustical Society of America. ,vol. 93, pp. 2258- 2258 ,(1988) , 10.1121/1.406635
Lalit R. Bahl, Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system Journal of the Acoustical Society of America. ,vol. 93, pp. 3541- 3541 ,(1990) , 10.1121/1.405346
M.-Y. Hwang, X. Huang, F. Alleva, Predicting unseen triphones with senones IEEE International Conference on Acoustics Speech and Signal Processing. ,vol. 2, pp. 311- 314 ,(1993) , 10.1109/ICASSP.1993.319299
H.-W. Hon, K.-F. Lee, CMU robust vocabulary-independent speech recognition system international conference on acoustics, speech, and signal processing. pp. 889- 892 ,(1991) , 10.1109/ICASSP.1991.150482
Raimo Bakis, Optimized speech recognition system and method Journal of the Acoustical Society of America. ,vol. 94, pp. 3538- 3538 ,(1990) , 10.1121/1.407134