Asynchronous-transition HMM

作者: Shigeki Matsuda , Mitsuru Nakai , Hiroshi Shimodaira , Shigeki Sagayama

DOI: 10.1109/ICASSP.2000.859132

关键词:

摘要: We propose a new class of hidden Markov model (HMM) called asynchronous-transition HMM (AT-HMM). Opposed to conventional HMMs where state transition occurs simultaneously all features, the allows transitions asynchronized between individual features better asynchronous timings acoustic feature changes. In this paper, we focus on particular AT-HMM with sequential constraints based novel concept "state tying along time". To maximize advantage model, also introduce feature-wise technique. Speaker-dependent speech recognition experiments demonstrated error reduction rates more than 30% and 50% in phoneme isolated word recognitions, respectively, compared HMMs.

参考文章(6)
S. Takahashi, S. Sagayama, Four-level tied-structure for efficient representation of acoustic modeling international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 520- 523 ,(1995) , 10.1109/ICASSP.1995.479643
S. Sagayama, Asynchronous-Transition HMM for Acoustic Modeling international conference on acoustics, speech, and signal processing. pp. 1001- 1004 ,(2000)
J.R. Bellegarda, D. Nahamoo, Tied mixture continuous parameter models for large vocabulary isolated speech recognition international conference on acoustics, speech, and signal processing. pp. 13- 16 ,(1989) , 10.1109/ICASSP.1989.266351
X.D. Huang, K.F. Lee, H.W. Hon, M.Y. Hwang, Improved acoustic modeling with the SPHINX speech recognition system international conference on acoustics, speech, and signal processing. pp. 345- 348 ,(1991) , 10.1109/ICASSP.1991.150347
J. Takami, S. Sagayama, A successive state splitting algorithm for efficient allophone modeling international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 573- 576 ,(1992) , 10.1109/ICASSP.1992.225855
M. Ostendorf, H. Singer, HMM topology design using maximum likelihood successive state splitting Computer Speech & Language. ,vol. 11, pp. 17- 41 ,(1997) , 10.1006/CSLA.1996.0021