Acoustic Markov models used in the Tangora speech recognition system

作者: L.R. Bahl , P.F. Brown , P.V. de Souza , M.A. Picheny

DOI: 10.1109/ICASSP.1988.196628

关键词:

摘要: The Speech Recognition Group at IBM Research has developed a real-time, isolated-word speech recognizer called Tangora, which accepts natural English sentences drawn from vocabulary of 20000 words. Despite its large vocabulary, the Tangora requires only about 20 minutes each new user for training purposes. accuracy system and ease are largely attributable to use hidden Markov models in acoustic match component. An automatic technique constructing word is described results included experiments with speaker-dependent speaker-independent on several recognition tasks. >

参考文章(18)
A. Averbuch, L. Bahl, R. Bakis, P. Brown, A. Cole, G. Daggett, S. Das, K. Davies, S. DeGennaro, P. de Souza, E. Epstein, D. Fraleigh, F. Jelinek, S. Katz, B. Lewis, R. Mercer, A. Nadas, D. Nahamoo, M. Picheny, G. Shichman, P. Spinelli, An IBM PC based large-vocabulary isolated-utterance speech recognizer international conference on acoustics, speech, and signal processing. ,vol. 11, pp. 53- 56 ,(1986) , 10.1109/ICASSP.1986.1169169
A. Averbuch, L. Bahl, R. Bakis, P. Brown, G. Daggett, S. Das, K. Davies, S. De Gennaro, P. de Souza, E. Epstein, D. Fraleigh, F. Jelinek, B. Lewis, R. Mercer, J. Moorhead, A. Nadas, D. Nahamoo, M. Picheny, G. Shichman, P. Spinelli, D. Van Compernolle, H. Wilkens, Experiments with the Tangora 20,000 word speech recognizer international conference on acoustics, speech, and signal processing. ,vol. 12, pp. 701- 704 ,(1987) , 10.1109/ICASSP.1987.1169870
L. Bahl, R. Bakis, P. Cohen, A. Cole, F. Jelinek, B. Lewis, R. Mercer, Further results on the recognition of a continuously read natural corpus ICASSP '80. IEEE International Conference on Acoustics, Speech, and Signal Processing. ,vol. 5, pp. 872- 875 ,(1980) , 10.1109/ICASSP.1980.1170862
R. Schwartz, Y. Chow, S. Roucos, M. Krasner, J. Makhoul, Improved hidden Markov modeling of phonemes for continuous speech recognition international conference on acoustics, speech, and signal processing. ,vol. 9, pp. 21- 24 ,(1984) , 10.1109/ICASSP.1984.1172751
L. Bahl, R. Bakis, P. Cohen, A. Cole, F. Jelinek, B. Lewis, R. Mercer, Recognition results for several experimental acoustic processors ICASSP '79. IEEE International Conference on Acoustics, Speech, and Signal Processing. ,vol. 4, pp. 249- 251 ,(1979) , 10.1109/ICASSP.1979.1170736
Lalit R. Bahl, Frederick Jelinek, Robert L. Mercer, A Maximum Likelihood Approach to Continuous Speech Recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. PAMI-5, pp. 179- 190 ,(1983) , 10.1109/TPAMI.1983.4767370
R. Bakis, Continuous speech recognition via centisecond acoustic states Journal of the Acoustical Society of America. ,vol. 59, ,(1976) , 10.1121/1.2003011
Frederick Jelinek, Continuous speech recognition by statistical methods Proceedings of the IEEE. ,vol. 64, pp. 532- 556 ,(1976) , 10.1109/PROC.1976.10159
J. Makhoul, S. Roucos, H. Gish, Vector quantization in speech coding Proceedings of the IEEE. ,vol. 73, pp. 1551- 1588 ,(1985) , 10.1109/PROC.1985.13340