Experiments with the Tangora 20,000 word speech recognizer

作者: A. Averbuch , L. Bahl , R. Bakis , P. Brown , G. Daggett

DOI: 10.1109/ICASSP.1987.1169870

关键词:

摘要: The Speech Recognition Group at IBM Research in Yorktown Heights has developed a real-time, isolated-utterance speech recognizer for natural language based on the Personal Computer AT and Signal Processors. system recently been enhanced by expanding vocabulary from 5,000 words to 20,000 addition of workstation support usability studies document creation voice. supports spelling interactive personalization augment vocabularies. This paper describes implementation, user interface, comparative performance recognizer.

参考文章(10)
A. Averbuch, L. Bahl, R. Bakis, P. Brown, A. Cole, G. Daggett, S. Das, K. Davies, S. DeGennaro, P. de Souza, E. Epstein, D. Fraleigh, F. Jelinek, S. Katz, B. Lewis, R. Mercer, A. Nadas, D. Nahamoo, M. Picheny, G. Shichman, P. Spinelli, An IBM PC based large-vocabulary isolated-utterance speech recognizer international conference on acoustics, speech, and signal processing. ,vol. 11, pp. 53- 56 ,(1986) , 10.1109/ICASSP.1986.1169169
L. Bahl, R. Bakis, P. Cohen, A. Cole, F. Jelinek, B. Lewis, R. Mercer, Speech recognition of a natural text read as isolated words international conference on acoustics, speech, and signal processing. ,vol. 6, pp. 1168- 1171 ,(1981) , 10.1109/ICASSP.1981.1171115
Lalit R. Bahl, Frederick Jelinek, Robert L. Mercer, A Maximum Likelihood Approach to Continuous Speech Recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. PAMI-5, pp. 179- 190 ,(1983) , 10.1109/TPAMI.1983.4767370
Frederick Jelinek, Continuous speech recognition by statistical methods Proceedings of the IEEE. ,vol. 64, pp. 532- 556 ,(1976) , 10.1109/PROC.1976.10159
Jordan R. Cohen, Application of an adaptive auditory model to speech recognition Journal of the Acoustical Society of America. ,vol. 78, ,(1985) , 10.1121/1.2022857
G. Shichman, Personal instrument (PI)--A PC-based signal processing system Ibm Journal of Research and Development. ,vol. 29, pp. 158- 169 ,(1985) , 10.1147/RD.292.0158
S. Katz, Estimation of probabilities from sparse data for the language model component of a speech recognizer IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 35, pp. 400- 401 ,(1987) , 10.1109/TASSP.1987.1165125
F. Jelinek, L. Bahl, R. Mercer, Design of a linguistic statistical decoder for the recognition of continuous speech IEEE Transactions on Information Theory. ,vol. 21, pp. 250- 256 ,(1975) , 10.1109/TIT.1975.1055384
F. Jelinek, The development of an experimental discrete dictation recognizer Proceedings of the IEEE. ,vol. 73, pp. 587- 595 ,(1985) , 10.1109/PROC.1985.13343
F. Jelinek, A real-time, isolated-word, speech recognition system for dictation transcription ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing. ,vol. 10, pp. 858- 861 ,(1985) , 10.1109/ICASSP.1985.1168313