Characterization of Dysphonic Voices by Means of a Filterbank-Based Spectral Analysis: Sustained Vowels and Running Speech

作者: Rubén Fraile , Juan Ignacio Godino-Llorente , Nicolás Sáenz-Lechón , Víctor Osma-Ruiz , Juana María Gutiérrez-Arriola

DOI: 10.1016/J.JVOICE.2012.07.004

关键词:

摘要: Summary Objectives This article presents a comparative study of the spectral power distribution for normal and dysphonic voices, both sustained vowels running speech. The objective this was to find robust cues dysphonia in domain. For purpose, recordings from two databases are processed, one them including Additionally, new measure stability is introduced (decorrelation time). application spectrum also tested as cue dysphonia. Materials Methods analysis done having an auditory model filterbank approach references computation discrete spectrograms. Results obtained three sets belonging different databases. reported results indicate that only minor differences exist shape voices when performing vowel phonation tasks. However, calculated band decorrelation times bands between 2000 6400Hz significantly less stable voices. As speech, not such good indicator dysphonia, but there significant difference level high-frequency (above 5300Hz). In addition, means sampling rates above 10.6ksps needed assessing speech Also, involving short-time analysis, frame 100 frames/s should be preferred.

参考文章(45)
Dimitar D. Deliyski, Acoustic model and evaluation of pathological voice production. conference of the international speech communication association. ,(1993)
John R. Buck, Alan V. Oppenheim, Ronald W. Schafer, Discrete-time signal processing (2nd ed.) Prentice-Hall, Inc.. ,(1999)
John G. Proakis, John R. Deller, John H. Hansen, Discrete-Time Processing of Speech Signals ,(1993)
Alan V. Oppenheim, Ronald W. Schafer, Discrete-Time Signal Processing ,(1989)
Christopher R. Watts, Shaheen N. Awan, Use of spectral/cepstral analyses for differentiating normal from hypofunctional voices in sustained vowel and continuous speech contexts. Journal of Speech Language and Hearing Research. ,vol. 54, pp. 1525- 1537 ,(2011) , 10.1044/1092-4388(2011/10-0209)
Linda Lee, Joseph C. Stemple, Leslie Glaze, Lisa N. Kelchner, Quick Screen for Voice and Supplementary Documents for Identifying Pediatric Voice Disorders Language Speech and Hearing Services in Schools. ,vol. 35, pp. 308- 319 ,(2004) , 10.1044/0161-1461(2004/030
P. H. Dejonckere, Patrick Bradley, Pais Clemente, Guy Cornut, Lise Crevier-Buchman, Gerhard Friedrich, Paul Van De Heyning, Marc Remacle, Virginie Woisard, A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques. Guideline elaborated by the Committee on Phoniatrics of the European Laryngological Society (ELS). European Archives of Oto-rhino-laryngology. ,vol. 258, pp. 77- 82 ,(2001) , 10.1007/S004050000299
Soren Y. Lowell, Raymond H. Colton, Richard T. Kelley, Youngmee C. Hahn, Spectral- and cepstral-based measures during continuous speech: Capacity to distinguish dysphonia and consistency within a speaker Journal of Voice. ,vol. 25, ,(2010) , 10.1016/J.JVOICE.2010.06.007
David M. Howard, Evelyn Abberton, Adrian Fourcin, Disordered voice measurement and auditory analysis Speech Communication. ,vol. 54, pp. 611- 621 ,(2012) , 10.1016/J.SPECOM.2011.03.008