Machine Recognition of Spoken Words

作者: Richard Fatehchand

DOI: 10.1016/S0065-2458(08)60609-9

关键词: Speech processingVisemeSpeech productionComputer scienceArtificial intelligenceSpeech analyticsAcoustic modelSpeech recognitionSpeaker recognitionAudio miningSpeech corpusNatural language processing

摘要: Publisher Summary The mechanical recognition of speech sounds is a field in which computers are now being used. This chapter discusses the present state by machines. Speech machines must work with acoustic wave as input, and therefore perform some or all processes normally province human listener. classifies various describes their properties. classification done on an articulatory basis. In addition, examines phonetic transcription speech, it indicates economical method for design cannot be treated independently occurring events. methods used machine discussed vowels, vowel-like sounds, fricatives, plosives, binary method. Present devices operate correctly only if speaker controls his/her pronunciation within narrow limits. Contemporary at classified (1) accurate any word out small group (subdivided into (a) that deal each unit (b) break words least approximately phonemic segments), (2) Machines designed to recognize large number different words. have little tolerance changes, self-adjustment features would appear necessary. Although there present, idea how this might current self-optimizing learning may prove relevant.

参考文章(34)
George A. Miller, Patricia E. Nicely, An Analysis of Perceptual Confusions Among Some English Consonants The Journal of the Acoustical Society of America. ,vol. 27, pp. 338- 352 ,(1955) , 10.1121/1.1907526
Peter Ladefoged, D. E. Broadbent, Information Conveyed by Vowels Journal of the Acoustical Society of America. ,vol. 29, pp. 98- 104 ,(1957) , 10.1121/1.1908694
R. K. Potter, J. C. Steinberg, Toward the Specification of Speech Journal of the Acoustical Society of America. ,vol. 22, pp. 807- 820 ,(1950) , 10.1121/1.1906694
Eugene Peterson, Franklin S. Cooper, Peakpicker: A Band‐Width Compression Device The Journal of the Acoustical Society of America. ,vol. 29, pp. 777- 777 ,(1957) , 10.1121/1.1918863
Carma Forgie, M. L. Groves, F. C. Frick, Automatic Recognition of Spoken Digits Journal of the Acoustical Society of America. ,vol. 30, pp. 669- 669 ,(1958) , 10.1121/1.1929935
George A. Miller, George A. Heise, William Lichten, The intelligibility of speech as a function of the context of the test materials. Journal of Experimental Psychology. ,vol. 41, pp. 329- 335 ,(1951) , 10.1037/H0062491
George W Hughes, Morris Halle, None, Spectral Properties of Fricative Consonants The Journal of the Acoustical Society of America. ,vol. 28, pp. 303- 310 ,(1956) , 10.1121/1.1908271
Lawrence Johnson, Irwin Pollack, Reproduction and Identification of Elements of Auditory Displays Journal of the Acoustical Society of America. ,vol. 30, pp. 673- 674 ,(1958) , 10.1121/1.1929959
M. Halle, G. W. Hughes, J.‐P. A. Radley, Acoustic Properties of Stop Consonants The Journal of the Acoustical Society of America. ,vol. 29, pp. 107- 116 ,(1957) , 10.1121/1.1908634
Carl Becker, Application of Nonlinear Vibration Isolators The Journal of the Acoustical Society of America. ,vol. 29, pp. 776- 776 ,(1957) , 10.1121/1.1918857