A model of acoustic interspeaker variability based on the concept of formant-cavity affiliation.

作者： Lian Apostol , Pascal Perrier , Gérard Bailly

DOI: 10.1121/1.1631946

关键词:

摘要: A method is proposed to model the interspeaker variability of formant patterns for oral vowels. It assumed that this originates in differences existing among speakers respective lengths their front and back vocal-tract cavities. In order characterize, from spectral description acoustic speech signal, these between speakers, each interpreted, according concept formant-cavity affiliation, as a resonance specific cavity. Its frequency can thus be directly related corresponding cavity length, transformation speaker B on basis ratios formants same resonances. minimize number sounds recorded carry out transformation, are exactly computed only three extreme cardinal vowels [i, a, u] they approximated remaining through an interpolation function. The evaluated its capacity transform (F1,F2) eight pronounced by five male into generated articulatory vocal tract. resulting compared those provided normalization techniques published literature. found efficient, but limitations also observed discussed. These associated with affiliation itself or possible influence speaker-specific geometry cross-sectional direction, which might not have taken account.

参考文章(23)

Maria-Gabriella Di Benedetto, Jean-Sylvain Liénard, Extrinsic normalization of vowel formant values based on cardinal vowels mapping. conference of the international speech communication association. ,(1992)

Shinji Maeda, Compensatory Articulation During Speech: Evidence from the Analysis and Synthesis of Vocal-Tract Shapes Using an Articulatory Model Springer, Dordrecht. pp. 131- 149 ,(1990) , 10.1007/978-94-009-2037-8_6

Gérard Bailly, Resonances as possible representation of speech in the auditory-to-articulatory transform. conference of the international speech communication association. ,(1993)

Kenneth N. Stevens, The Chiba and Kajiyama Book as a Precursor to the Acoustic Theory of Speech Production( Sixtieth Anniversary of the Publication of The Vowel, Its Nature and Structure by Chiba and Kajiyama) Journal of the Phonetic Society of Japan. ,vol. 5, pp. 6- 7 ,(2001)

Terrance Michael Nearey, Phonetic feature systems for vowels ,(1978)

Christophe Savariaux, Pascal Perrier, Jean Pierre Orliaguet, Compensation strategies for the perturbation of the rounded vowel [u] using a lip tube: A study of the control space in speech production Journal of the Acoustical Society of America. ,vol. 98, pp. 2428- 2442 ,(1995) , 10.1121/1.413277

Sandra Ferrari Disner, Evaluation of vowel normalization procedures Journal of the Acoustical Society of America. ,vol. 67, pp. 253- 261 ,(1980) , 10.1121/1.383734

Ingo Titze, Darrell Wong, Brad Story, Russell Long, Considerations in voice transformation with physiologic scaling principles Speech Communication. ,vol. 22, pp. 113- 123 ,(1997) , 10.1016/S0167-6393(97)00014-9

Kenneth N. Stevens, Acoustic correlates of some phonetic categories. Journal of the Acoustical Society of America. ,vol. 68, pp. 836- 842 ,(1979) , 10.1121/1.384823

10.

Boris M Lobanov, None, Classification of Russian Vowels Spoken by Different Speakers The Journal of the Acoustical Society of America. ,vol. 49, pp. 606- 608 ,(1971) , 10.1121/1.1912396

A model of acoustic interspeaker variability based on the concept of formant-cavity affiliation.

来源期刊

我的账户

A model of acoustic interspeaker variability based on the concept of formant-cavity affiliation.

来源期刊

相似文章 10

我的账户