Yet Another Approach for the Measurement of the Degree of Voice Normality: A Simple Scheme Based on Feature Reduction and Single Gaussian Distributions

作者: Fay. Ykhlef , W. Benzaba , R. Boutaleb , Jessus B. Alonso , Far. Ykhlef

DOI: 10.1109/ISM.2015.23

关键词: NormalityComputer scienceStatistical modelPattern recognitionGaussianLinear discriminant analysisSet (abstract data type)Artificial intelligenceMel-frequency cepstrumFeature (machine learning)Reduction (complexity)

摘要: In this paper, we propose another approach for the measurement of degree voice normality based on statistical modeling. The basic methodology behind proposed is "Pathological Likelihood Index" reported by Godino-Llorente JI. et al. [1]. major innovations are: exploring a reduced set Mel frequency cepstral coefficients (MFCC) and ignoring their derivatives, linear projection MFCCs into one dimensional space using Fisher's discriminant, and, modeling build around single Gaussian distributions instead mixtures distributions. We have evaluated Massachusetts Eye Ear Infirmary database (MEEI). obtained results are better than in

参考文章(6)
Robert Thayer Sataloff, Diagnosis and Treatment of Voice Disorders ,(2014)
Christopher M. Bishop, Pattern Recognition and Machine Learning ,(2006)
Juan Ignacio Godino-Llorente, Pedro Gómez-Vilda, Fernando Cruz-Roldán, Manuel Blanco-Velasco, Rubén Fraile, Pathological likelihood index as a measurement of the degree of voice normality and perceived hoarseness. Journal of Voice. ,vol. 24, pp. 667- 677 ,(2010) , 10.1016/J.JVOICE.2009.04.003
Julián David Arias-Londoño, Juan I. Godino-Llorente, Maria Markaki, Yannis Stylianou, On combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices Logopedics Phoniatrics Vocology. ,vol. 36, pp. 60- 69 ,(2011) , 10.3109/14015439.2010.528788
S. Davis, P. Mermelstein, Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 28, pp. 65- 74 ,(1980) , 10.1109/TASSP.1980.1163420
J.I. Godino-Llorente, P. Gomez-Vilda, M. Blanco-Velasco, Dimensionality Reduction of a Pathological Voice Quality Assessment System Based on Gaussian Mixture Models and Short-Term Cepstral Parameters IEEE Transactions on Biomedical Engineering. ,vol. 53, pp. 1943- 1953 ,(2006) , 10.1109/TBME.2006.871883