Comparing measurement errors for formants in synthetic and natural vowels.

作者: Christine H. Shadle , Hosung Nam , D. H. Whalen

DOI: 10.1121/1.4940665

关键词:

摘要: The measurement of formant frequencies vowels is among the most common measurements in speech studies, but are known to be biased by particular fundamental frequency (F0) exciting formants. Approaches reducing errors were assessed two experiments. In first, synthetic constructed with five different first (F1) values and nine F0 values; bandwidths, higher frequencies, constant. Input compared manual automatic measures using linear prediction coding-Burg algorithm, closed-phase covariance, weighted prediction-attenuated main excitation (WLP-AME) algorithm [Alku, Pohjalainen, Vainio, Laukkanen, Story (2013). J. Acoust. Soc. Am. 134(2), 1295-1313], spectra smoothed cepstrally averaging repeated discrete Fourier transforms. Formants also measured manually from pruned reassigned spectrograms (RSs) [Fulop (2011). Speech Spectrum Analysis (Springer, Berlin)]. All WLP-AME RS had large direction strongest harmonic; smallest occur RS. second experiment, these methods used on isolated words spoken four speakers. Results for natural show that bias affects all methods, including WLP-AME; only formants appeared accurate. addition, coped better weaker glottal fry.

参考文章(39)
Sean A. Fulop, The Reassigned Spectrogram Springer, Berlin, Heidelberg. pp. 127- 165 ,(2011) , 10.1007/978-3-642-17478-0_6
Donald G. Childers, Modern Spectrum Analysis ,(1978)
Ingo R. Titze, Ronald J. Baken, Kenneth W. Bozeman, Svante Granqvist, Nathalie Henrich, Christian T. Herbst, David M. Howard, Eric J. Hunter, Dean Kaelin, Raymond D. Kent, Jody Kreiman, Malte Kob, Anders Löfqvist, Scott McCoy, Donald G. Miller, Hubert Noé, Ronald C. Scherer, John R. Smith, Brad H. Story, Jan G. Švec, Sten Ternström, Joe Wolfe, Toward a consensus on symbolic notation of harmonics, resonances, and formants in vocalization Journal of the Acoustical Society of America. ,vol. 137, pp. 3005- 3007 ,(2015) , 10.1121/1.4919349
J. Holmes, Formant excitation before and after glottal closure international conference on acoustics, speech, and signal processing. ,vol. 1, pp. 39- 42 ,(1976) , 10.1109/ICASSP.1976.1170095
Y. PHAM THI NGOC, P. BADIN, Vocal tract acoustic transfer function measurements : further developments and applications Journal De Physique Iv. ,vol. 04, pp. 549- 552 ,(1994) , 10.1051/JP4:19945118
T. Baer, J. C. Gore, L. C. Gracco, P. W. Nye, Analysis of vocal tract shape and dimensions using magnetic resonance imaging: Vowels Journal of the Acoustical Society of America. ,vol. 90, pp. 799- 828 ,(1991) , 10.1121/1.401949
Sean A. Fulop, Accuracy of formant measurement for synthesized vowels using the reassigned spectrogram and comparison with linear prediction. Journal of the Acoustical Society of America. ,vol. 127, pp. 2114- 2117 ,(2010) , 10.1121/1.3308476
R. K. Potter, J. C. Steinberg, Toward the Specification of Speech Journal of the Acoustical Society of America. ,vol. 22, pp. 807- 820 ,(1950) , 10.1121/1.1906694