Acoustic Markov models used in the Tangora speech recognition system

作者： L.R. Bahl , P.F. Brown , P.V. de Souza , M.A. Picheny

DOI: 10.1109/ICASSP.1988.196628

关键词:

摘要: The Speech Recognition Group at IBM Research has developed a real-time, isolated-word speech recognizer called Tangora, which accepts natural English sentences drawn from vocabulary of 20000 words. Despite its large vocabulary, the Tangora requires only about 20 minutes each new user for training purposes. accuracy system and ease are largely attributable to use hidden Markov models in acoustic match component. An automatic technique constructing word is described results included experiments with speaker-dependent speaker-independent on several recognition tasks. >

参考文章(18)

A. Averbuch, L. Bahl, R. Bakis, P. Brown, A. Cole, G. Daggett, S. Das, K. Davies, S. DeGennaro, P. de Souza, E. Epstein, D. Fraleigh, F. Jelinek, S. Katz, B. Lewis, R. Mercer, A. Nadas, D. Nahamoo, M. Picheny, G. Shichman, P. Spinelli, An IBM PC based large-vocabulary isolated-utterance speech recognizer international conference on acoustics, speech, and signal processing. ,vol. 11, pp. 53- 56 ,(1986) , 10.1109/ICASSP.1986.1169169

L. Baum, An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process Inequalities. ,vol. 3, pp. 1- 8 ,(1972)

A. Averbuch, L. Bahl, R. Bakis, P. Brown, G. Daggett, S. Das, K. Davies, S. De Gennaro, P. de Souza, E. Epstein, D. Fraleigh, F. Jelinek, B. Lewis, R. Mercer, J. Moorhead, A. Nadas, D. Nahamoo, M. Picheny, G. Shichman, P. Spinelli, D. Van Compernolle, H. Wilkens, Experiments with the Tangora 20,000 word speech recognizer international conference on acoustics, speech, and signal processing. ,vol. 12, pp. 701- 704 ,(1987) , 10.1109/ICASSP.1987.1169870

L. Bahl, R. Bakis, P. Cohen, A. Cole, F. Jelinek, B. Lewis, R. Mercer, Further results on the recognition of a continuously read natural corpus ICASSP '80. IEEE International Conference on Acoustics, Speech, and Signal Processing. ,vol. 5, pp. 872- 875 ,(1980) , 10.1109/ICASSP.1980.1170862

R. Schwartz, Y. Chow, S. Roucos, M. Krasner, J. Makhoul, Improved hidden Markov modeling of phonemes for continuous speech recognition international conference on acoustics, speech, and signal processing. ,vol. 9, pp. 21- 24 ,(1984) , 10.1109/ICASSP.1984.1172751

L. Bahl, R. Bakis, P. Cohen, A. Cole, F. Jelinek, B. Lewis, R. Mercer, Recognition results for several experimental acoustic processors ICASSP '79. IEEE International Conference on Acoustics, Speech, and Signal Processing. ,vol. 4, pp. 249- 251 ,(1979) , 10.1109/ICASSP.1979.1170736

Lalit R. Bahl, Frederick Jelinek, Robert L. Mercer, A Maximum Likelihood Approach to Continuous Speech Recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. PAMI-5, pp. 179- 190 ,(1983) , 10.1109/TPAMI.1983.4767370

R. Bakis, Continuous speech recognition via centisecond acoustic states Journal of the Acoustical Society of America. ,vol. 59, ,(1976) , 10.1121/1.2003011

Frederick Jelinek, Continuous speech recognition by statistical methods Proceedings of the IEEE. ,vol. 64, pp. 532- 556 ,(1976) , 10.1109/PROC.1976.10159

10.

J. Makhoul, S. Roucos, H. Gish, Vector quantization in speech coding Proceedings of the IEEE. ,vol. 73, pp. 1551- 1588 ,(1985) , 10.1109/PROC.1985.13340

Acoustic Markov models used in the Tangora speech recognition system

来源期刊

我的账户

Acoustic Markov models used in the Tangora speech recognition system

来源期刊

相似文章 10

我的账户