A model of acoustic interspeaker variability based on the concept of formant-cavity affiliation.

作者: Lian Apostol , Pascal Perrier , Gérard Bailly

DOI: 10.1121/1.1631946

关键词:

摘要: A method is proposed to model the interspeaker variability of formant patterns for oral vowels. It assumed that this originates in differences existing among speakers respective lengths their front and back vocal-tract cavities. In order characterize, from spectral description acoustic speech signal, these between speakers, each interpreted, according concept formant-cavity affiliation, as a resonance specific cavity. Its frequency can thus be directly related corresponding cavity length, transformation speaker B on basis ratios formants same resonances. minimize number sounds recorded carry out transformation, are exactly computed only three extreme cardinal vowels [i, a, u] they approximated remaining through an interpolation function. The evaluated its capacity transform (F1,F2) eight pronounced by five male into generated articulatory vocal tract. resulting compared those provided normalization techniques published literature. found efficient, but limitations also observed discussed. These associated with affiliation itself or possible influence speaker-specific geometry cross-sectional direction, which might not have taken account.

参考文章(23)
Maria-Gabriella Di Benedetto, Jean-Sylvain Liénard, Extrinsic normalization of vowel formant values based on cardinal vowels mapping. conference of the international speech communication association. ,(1992)
Gérard Bailly, Resonances as possible representation of speech in the auditory-to-articulatory transform. conference of the international speech communication association. ,(1993)
Terrance Michael Nearey, Phonetic feature systems for vowels ,(1978)
Christophe Savariaux, Pascal Perrier, Jean Pierre Orliaguet, Compensation strategies for the perturbation of the rounded vowel [u] using a lip tube: A study of the control space in speech production Journal of the Acoustical Society of America. ,vol. 98, pp. 2428- 2442 ,(1995) , 10.1121/1.413277
Sandra Ferrari Disner, Evaluation of vowel normalization procedures Journal of the Acoustical Society of America. ,vol. 67, pp. 253- 261 ,(1980) , 10.1121/1.383734
Ingo Titze, Darrell Wong, Brad Story, Russell Long, Considerations in voice transformation with physiologic scaling principles Speech Communication. ,vol. 22, pp. 113- 123 ,(1997) , 10.1016/S0167-6393(97)00014-9
Kenneth N. Stevens, Acoustic correlates of some phonetic categories. Journal of the Acoustical Society of America. ,vol. 68, pp. 836- 842 ,(1979) , 10.1121/1.384823
Boris M Lobanov, None, Classification of Russian Vowels Spoken by Different Speakers The Journal of the Acoustical Society of America. ,vol. 49, pp. 606- 608 ,(1971) , 10.1121/1.1912396