Recognition results for several experimental acoustic processors

作者: L. Bahl , R. Bakis , P. Cohen , A. Cole , F. Jelinek

DOI: 10.1109/ICASSP.1979.1170736

关键词:

摘要: The statistical training and decoding procedures developed at IBM Research can be used with a wide variety of acoustic processors. We have recently (July August 1978) achieved error-free or nearly results several different processors on sentences from the New Raleigh Language (vocabulary 250 words, perplexity 7.27 words). One these processors, which 0% error rate sentences, has been to decode without benefit syntactic guidance during process. On this much more difficult task, it an 8.8% word level, corresponding sentence 53%. All are non-segmenting produce output once every 10ms.

参考文章(8)
L. Bahl, J. Baker, P. Cohen, A. Cole, F. Jelinek, B. Lewis, R. Mercer, Automatic recognition of continuously spoken sentences from a finite state grammer ICASSP '78. IEEE International Conference on Acoustics, Speech, and Signal Processing. ,vol. 3, pp. 418- 421 ,(1978) , 10.1109/ICASSP.1978.1170404
W. Klein, R. Plomp, L. C. W. Pols, Vowel Spectra, Vowel Spaces, and Vowel Identification The Journal of the Acoustical Society of America. ,vol. 48, pp. 999- 1009 ,(1970) , 10.1121/1.1912239
R. Bakis, Continuous speech recognition via centisecond acoustic states Journal of the Acoustical Society of America. ,vol. 59, ,(1976) , 10.1121/1.2003011
Frederick Jelinek, Continuous speech recognition by statistical methods Proceedings of the IEEE. ,vol. 64, pp. 532- 556 ,(1976) , 10.1109/PROC.1976.10159
N. Dixon, H. Silverman, The 1976 modular acoustic processor(MAP) IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 25, pp. 367- 379 ,(1977) , 10.1109/TASSP.1977.1162985
F. Jelinek, R. L. Mercer, L. R. Bahl, J. K. Baker, Perplexity—a measure of the difficulty of speech recognition tasks Journal of the Acoustical Society of America. ,vol. 62, ,(1977) , 10.1121/1.2016299
F. Jelinek, L. Bahl, R. Mercer, Design of a linguistic statistical decoder for the recognition of continuous speech IEEE Transactions on Information Theory. ,vol. 21, pp. 250- 256 ,(1975) , 10.1109/TIT.1975.1055384
L. Bahl, F. Jelinek, Decoding for channels with insertions, deletions, and substitutions with applications to speech recognition IEEE Transactions on Information Theory. ,vol. 21, pp. 404- 411 ,(1975) , 10.1109/TIT.1975.1055419