A psychoacoustic method to find the perceptual cues of stop consonants in natural speech.

作者: Feipeng Li , Anjali Menon , Jont B. Allen

DOI: 10.1121/1.3295689

关键词:

摘要: Synthetic speech has been widely used in the study of cues. A serious disadvantage this method is that it requires prior knowledge about cues to be identified order synthesize speech. Incomplete or inaccurate hypotheses often lead sounds low quality. In research a psychoacoustic method, named three-dimensional deep search (3DDS), developed explore perceptual stop consonants from naturally produced For given sound, measures contribution each subcomponent perception by time truncating, highpass/lowpass filtering, masking with white noise. The AI-gram, visualization tool simulates auditory peripheral processing, predict audible components sound. results are generally agreement classical studies stops characterized short duration burst followed F2 transition, suggesting effectiveness 3DDS method. However, also shown /ba/ and /pa/ may have wide band click as dominant cue. transition not necessary for /ta/ /ka/. Moreover, many contain conflicting characteristic competing sounds. robustness consonant sound noise determined intensity

参考文章(62)
Proadpran Punyabukkana, Atiwong Suchato, Factors in classification of stop consonant place of articulation. conference of the international speech communication association. pp. 2969- 2972 ,(2005)
Jont B. Allen, Nonlinear Cochlear Signal Processing and Masking in Speech Perception Springer, Berlin, Heidelberg. pp. 27- 60 ,(2008) , 10.1007/978-3-540-49127-9_3
Fletcher, Harvey, b., Speech and hearing in communication ,(1953)
Abeer Abdul-Hussain. Alwan, Modeling speech perception in noise : the stop consonants as a case study Research Laboratory of Electronics, Massachusetts Institute of Technology. ,(1992)
Valerie Hazan, Stuart Rosen, Individual variability in the perception of cues to place contrasts in initial stops Perception & Psychophysics. ,vol. 49, pp. 187- 200 ,(1991) , 10.3758/BF03205038
Sandeep A. Phatak, Andrew Lovitt, Jont B. Allen, Consonant confusions in white noise The Journal of the Acoustical Society of America. ,vol. 124, pp. 1220- 1233 ,(2008) , 10.1121/1.2913251
Marion S. Régnier, Jont B. Allen, A method to identify noise-robust perceptual features: Application for consonant /t/ The Journal of the Acoustical Society of America. ,vol. 123, pp. 2801- 2814 ,(2008) , 10.1121/1.2897915
George A. Miller, Patricia E. Nicely, An Analysis of Perceptual Confusions Among Some English Consonants The Journal of the Acoustical Society of America. ,vol. 27, pp. 338- 352 ,(1955) , 10.1121/1.1907526
André Malécot, Computer‐Assisted Phonetic Analysis Techniques for Large Recorded Corpuses of Natural Speech The Journal of the Acoustical Society of America. ,vol. 53, pp. 356- 356 ,(1973) , 10.1121/1.1982570
Diane Ronan, Ann K. Dix, Phalguni Shah, Louis D. Braida, Integration across frequency bands for consonant identification The Journal of the Acoustical Society of America. ,vol. 116, pp. 1749- 1762 ,(2004) , 10.1121/1.1777858