The development of an experimental discrete dictation recognizer

作者: F. Jelinek

DOI: 10.1109/PROC.1985.13343

关键词:

摘要: This paper describes an experimental real-time recognizer of isolated word dictation implemented at the IBM Thomas J. Watson Research Center, on a system commercially available computers and array processors. The recognizer's intended use is creation office memoranda. It based 5000-word vocabulary. A specially designed workstation enables user to correct edit transcribed speech. outlines self-organized, statistical approach underlying basic algorithms recognizer. Results several recognition experiments are then presented. rest considers important issues in future development recognizers, such as vocabulary selection, language model creation, human factors.

参考文章(19)
A. Nadas, R. Mercer, L. Bahl, R. Bakis, P. Cohen, A. Cole, F. Jelinek, B. Lewis, Continuous speech recognition with automatically selected acoustic prototypes obtained by either bootstrapping or clustering international conference on acoustics, speech, and signal processing. ,vol. 6, pp. 1153- 1155 ,(1981) , 10.1109/ICASSP.1981.1171177
D. Huttenlocher, V. Zue, A model of lexical access from partial phonetic information international conference on acoustics, speech, and signal processing. ,vol. 9, pp. 391- 394 ,(1984) , 10.1109/ICASSP.1984.1172525
Lalit R. Bahl, Frederick Jelinek, Robert L. Mercer, A Maximum Likelihood Approach to Continuous Speech Recognition IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. PAMI-5, pp. 179- 190 ,(1983) , 10.1109/TPAMI.1983.4767370
H. Abut, R. Gray, G. Rebolledo, Vector quantization of speech and speech-like waveforms IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 30, pp. 423- 435 ,(1982) , 10.1109/TASSP.1982.1163907
Frederick Jelinek, Continuous speech recognition by statistical methods Proceedings of the IEEE. ,vol. 64, pp. 532- 556 ,(1976) , 10.1109/PROC.1976.10159
John Bissell Carroll, Barry Richman, Peter Davies, The American heritage word frequency book ,(1971)
J. D. GOULD, S. J. BOIES, Writing, Dictating, and Speaking Letters Science. ,vol. 201, pp. 1145- 1147 ,(1978) , 10.1126/SCIENCE.201.4361.1145
John D. Gould, Stephen J. Boies, Human factors challenges in creating a principal support office system—the speech filing system approach ACM Transactions on Information Systems. ,vol. 1, pp. 273- 298 ,(1983) , 10.1145/357442.357443