Comparison of spectral and prosodic parameters of male and female emotional speech in Czech and Slovak

作者: J. Pribil , A. Pribilova

DOI: 10.1109/ICASSP.2011.5947409

关键词: Natural language processingSlovakFormantArtificial intelligenceCepstrumComputer scienceSpeech processingSpeech productionSpectral flatnessJitterSpeech recognitionCzech

摘要: This paper analyzes and compares spectral properties (first three formants position flatness measure values) prosodic parameters (F0 energy, microintonation jitter) of male female acted emotional speech in Czech Slovak languages. Statistical results values parameter ratios will be used for modification the text-to-speech (TTS) system enabling expressive production with / voices, based on cepstral description.

参考文章(13)
Pascual Ejarque, Mireia Farrús, Javier Hernando, Jitter and shimmer measurements for speaker recognition conference of the international speech communication association. pp. 778- 781 ,(2007)
Jiří Přibil, Anna Přibilová, Application of Expressive Speech in TTS System with Cepstral Description Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction. pp. 200- 212 ,(2008) , 10.1007/978-3-540-70872-8_15
Gunnar Fant, Speech acoustics and phonetics Kluwer Academic. ,(2004)
Alan V. Oppenheim, Ronald W. Schafer, Discrete-Time Signal Processing ,(1989)
P. Boersma, Praat, a system for doing phonetics by computer Glot International. ,vol. 5, pp. 341- 345 ,(2002)
K Scherer, Vocal communication of emotion: A review of research paradigms Speech Communication. ,vol. 40, pp. 227- 256 ,(2003) , 10.1016/S0167-6393(02)00084-5
Xi Li, Jidong Tao, Michael T. Johnson, Joseph Soltis, Anne Savage, Kirsten M. Leong, John D. Newman, Stress and Emotion Classification using Jitter and Shimmer Features international conference on acoustics, speech, and signal processing. ,vol. 4, pp. 1081- 1084 ,(2007) , 10.1109/ICASSP.2007.367261
Anna Přibilová, Jiří Přibil, Non-linear frequency scale mapping for voice conversion in text-to-speech system with cepstral description non linear speech processing. ,vol. 48, pp. 1691- 1703 ,(2006) , 10.1016/J.SPECOM.2006.08.001
A. Gray, J. Markel, A spectral-flatness measure for studying the autocorrelation method of linear prediction of speech analysis IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 22, pp. 207- 217 ,(1974) , 10.1109/TASSP.1974.1162572
Ignasi Iriondo, Santiago Planet, Joan-Claudi Socoró, Elisa Martínez, Francesc Alías, Carlos Monzo, Automatic refinement of an expressive speech corpus assembling subjective perception and automatic classification non linear speech processing. ,vol. 51, pp. 744- 758 ,(2009) , 10.1016/J.SPECOM.2008.12.001