Considerations in voice transformation with physiologic scaling principles

作者： Ingo Titze , Darrell Wong , Brad Story , Russell Long

关键词:

摘要: Abstract This study begins to explore the importance of physiological domain in voice transformation. A general approach is outlined for transforming quality sentence-level speech while maintaining same phonetic content. Transformations will eventually include gender, age, quality, emotional state, disordered dialect or impersonation. In this paper, only a specific twang, described as an example. The basic question is: relative pure signal processing, can voices be transformed more effectively if biomechanical, acoustic and anatomical scaling principles are applied? At present, two approaches contrasted, Linear Predictive Coding biomechanical simulation approach.

sciencedirect.com 本地加速

参考文章(21)

Vesa Vlimki, Discrete-Time Modeling of Acoustic Tubes Using Fractional Delay Filters ,(1998)

Hideki Kasuya, Chang-Sheng Yang, Uniform and Non-uniform Normalization of Vocal Tracts Measured by MRI Across Male, Female and Child Subjects IEICE Transactions on Information and Systems. ,vol. 78, pp. 732- 737 ,(1995)

靖藤村, 実平野, Vocal fold physiology : voice quality control Singular Pub. Group. ,(1995)

Jan Gauffin, Britta Hammarberg, Vocal Fold Physiology: Acoustic, Perceptual, and Physiological Aspects of Voice Mechanisms ,(1991)

Eiji Yanagisawa, Jo Estill, Steven T. Kmucha, Steven B. Leder, The contribution of aryepiglottic constriction to “ringing” voice quality—A videolaryngoscopic study with acoustic analysis Journal of Voice. ,vol. 3, pp. 342- 350 ,(1989) , 10.1016/S0892-1997(89)80057-8

Elaine T. Stathopoulos, Christine Sapienza, Respiratory and laryngeal function of women and men during vocal intensity variation. Journal of Speech Language and Hearing Research. ,vol. 36, pp. 64- 75 ,(1993) , 10.1044/JSHR.3601.64

D. G. Childers, H. T. Hu, Speech synthesis by glottal excited linear prediction Journal of the Acoustical Society of America. ,vol. 96, pp. 2026- 2036 ,(1994) , 10.1121/1.411319

Mazin G. Rahim, John C. Burgess, Artificial Neural Networks for Speech Analysis/Synthesis ,(1994)

Eva B. Holmberg, Robert E. Hillman, Joseph S. Perkell, Glottal airflow and transglottal air pressure measurements for male and female speakers in soft, normal, and loud voice Journal of the Acoustical Society of America. ,vol. 84, pp. 511- 529 ,(1988) , 10.1121/1.396829

10.

Ingo R. Titze, Sharyn Mapes, Brad Story, Acoustics of the tenor high voice. Journal of the Acoustical Society of America. ,vol. 95, pp. 1133- 1142 ,(1992) , 10.1121/1.408461

Considerations in voice transformation with physiologic scaling principles

来源期刊

我的账户

Considerations in voice transformation with physiologic scaling principles

来源期刊

相似文章 10

我的账户