Spectral Distribution of Prosodic Information

作者: Ken W. Grant , Brian E. Walden

DOI: 10.1044/JSHR.3902.228

关键词:

摘要: Prosodic speech cues for rhythm, stress, and intonation are related primarily to variations in intensity, duration, and fundamental frequency. Because these cues make use of temporal properties of the speech waveform they are likely to be represented broadly across the speech spectrum. In order to determine the relative importance of different frequency regions for the recognition of Prosodic cues, identification of four Prosodic features, syllable number, syllabic stress, sentence intonation, and phrase boundary location, was evaluated under six filter conditions spanning the range from 200–6100 Hz. Each filter condition had equal articulation index (Al) weights, Al ½ 0.10; p(C) isolated words ≈ 0.40. Results obtained with normally hearing subjects showed that there was an interaction between filter condition and the identification of specific Prosodic features. For example, information from high-frequency regions of speech was particularly useful in the identification of syllable number and stress, whereas information from low-frequency regions was helpful in identifying intonation patterns. In spite of these spectral differences, overall listeners performed remarkably well in identifying Prosodic patterns, although individual differences were apparent. For some subjects, equivalent levels of performance across the six filter conditions were achieved. These results are discussed in relation to auditory and auditory-visual speech recognition.

参考文章(50)
Quentin Summerfield, Some preliminaries to a comprehensive account of audio-visual speech perception. Lawrence Erlbaum Associates, Inc. ,(1987)
Robert S. C. Cowan, Lesley A. Whitford, Joseph I. Alcantara, Peter J. Blamey, Graeme M. Clark, Speech perception using combinations of auditory, visual, and tactile information Journal of Rehabilitation Research and Development. ,vol. 26, pp. 15- 24 ,(1989)
Charles Read, Raymond D. Kent, The acoustic analysis of speech Singular/Thomson Learning. ,(1992)
Nancy S. McGarr, Patricia M. Hargrove, Prosody Management of Communication Disorders ,(1994)
Frank C. Merewether, Murray Alpert, The components and neuroanatomic bases of prosody Journal of Communication Disorders. ,vol. 23, pp. 325- 336 ,(1990) , 10.1016/0021-9924(90)90007-L
William M. Christie, Some cues for syllable juncture perception in English The Journal of the Acoustical Society of America. ,vol. 55, pp. 819- 821 ,(1974) , 10.1121/1.1914606
George A. Miller, Patricia E. Nicely, An Analysis of Perceptual Confusions Among Some English Consonants The Journal of the Acoustical Society of America. ,vol. 27, pp. 338- 352 ,(1955) , 10.1121/1.1907526
Stuart Michael Rosen, AJ Fourcin, Brian CJ Moore, None, Voice pitch as an aid to lipreading Nature. ,vol. 291, pp. 150- 152 ,(1981) , 10.1038/291150A0