Extended Viterbi algorithm for optimized word HMMS

作者: Michael Gerber , Tobias Kaufmann , Beat Pfister

DOI: 10.1109/ICASSP.2011.5947462

关键词:

摘要: This paper deals with the problem of finding optimal sequence sub-word unit HMMs for a number given utterances word. For this we present new solution based on an extension Viterbi algorithm which maximizes joint probability and all possible sequences units hence guarantees to find solution. The was applied in isolated word recognition experiment compared simpler approaches determining units. We report significant reduction error rate algorithm.

参考文章(8)
Maximilian Bisani, Hermann Ney, Breadth-first search for finding the optimal phonetic transcription from multiple utterances. conference of the international speech communication association. pp. 1429- 1432 ,(2001)
Torbjørn Svendsen, Frank K. Soong, Heiko Purnhagen, Optimizing baseforms for HMM-based speech recognition. conference of the international speech communication association. ,(1995)
Beat Pfister, Michael Gerber, Fast search for common segments in speech signals for speaker verification. conference of the international speech communication association. pp. 375- 378 ,(2008)
L. Gillick, S.J. Cox, Some statistical issues in the comparison of speech recognition algorithms international conference on acoustics, speech, and signal processing. pp. 532- 535 ,(1989) , 10.1109/ICASSP.1989.266481
F.K. Soong, E.-F. Huang, A tree-trellis based fast search for finding the N-best sentence hypotheses in continuous speech recognition international conference on acoustics, speech, and signal processing. pp. 705- 708 ,(1991) , 10.1109/ICASSP.1991.150437
L.R. Bahl, P.F. Brown, P.V. de Souza, R.L. Mercer, M.A. Picheny, A method for the construction of acoustic Markov models for words IEEE Transactions on Speech and Audio Processing. ,vol. 1, pp. 443- 452 ,(1993) , 10.1109/89.242490
Jianxiong Wu, Vishwa Gupta, Application of simultaneous decoding algorithms to automatic transcription of known and unknown words international conference on acoustics speech and signal processing. ,vol. 2, pp. 589- 592 ,(1999) , 10.1109/ICASSP.1999.759735