Characteristics of human auditory model based on compensation of glottal features in speech emotion recognition

作者: Sun Ying , Zhang Xue-Ying

DOI: 10.1016/J.FUTURE.2017.10.002

关键词:

摘要: Abstract The speech signal carries emotional message during its production. With the analysis on relation between sound production and glottis, paper has introduced glottal features into emotion recognition, proposed model where glottis is used for compensation of features, extracted feature Glottal Compensation to Zero Crossings with Maximal Teager Energy Operator (GCZCMT). Two experiments have been designed, including that: firstly, single databases TYUT Berlin are respectively experiment (the purpose such research recognition capability GCZCMT feature, experimental results show that a possibly effectively distinguishing state); secondly, this one mixing database ross-database language, dependency minimum, more suitable actual complex language environment, higher practical value.

参考文章(15)
Rui Sun, Elliot Moore, Investigating glottal parameters and teager energy operators in emotion recognition affective computing and intelligent interaction. pp. 425- 434 ,(2011) , 10.1007/978-3-642-24571-8_54
Sylvie J. L. Mozziconacci, Modeling Emotion and Attitude in Speech by Means of Perceptually Based Parameter Values User Modeling and User-adapted Interaction. ,vol. 11, pp. 297- 326 ,(2001) , 10.1023/A:1011800417621
Krishna Mohan Kudiri, Abas Md Said, M Yunus Nayan, Emotion detection using relative amplitude-based features through speech international conference on computer and information science. ,vol. 1, pp. 522- 525 ,(2012) , 10.1109/ICCISCI.2012.6297301
Shashidhar G. Koolagudi, K. Sreenivasa Rao, Emotion recognition from speech using source, system, and prosodic features International Journal of Speech Technology. ,vol. 15, pp. 265- 289 ,(2012) , 10.1007/S10772-012-9139-3
Alexander I. Iliev, Michael S. Scordilis, Spoken emotion recognition using glottal symmetry EURASIP Journal on Advances in Signal Processing. ,vol. 2011, pp. 624575- ,(2011) , 10.1155/2011/624575
Jouni Pohjalainen, Tuomo Raitio, Santeri Yrttiaho, Paavo Alku, Detection of shouted speech in noise: Human and machine Journal of the Acoustical Society of America. ,vol. 133, pp. 2377- 2389 ,(2013) , 10.1121/1.4794394
Alexander I Iliev, Michael S Scordilis, Joao P Papa, Alexandre X Falcao, None, Spoken emotion recognition through optimum-path forest classification using glottal features Computer Speech & Language. ,vol. 24, pp. 445- 460 ,(2010) , 10.1016/J.CSL.2009.02.005
Hariharan Muthusamy, Kemal Polat, Sazali Yaacob, Improved Emotion Recognition Using Gaussian Mixture Model and Extreme Learning Machine in Speech and Glottal Signals Mathematical Problems in Engineering. ,vol. 2015, pp. 1- 13 ,(2015) , 10.1155/2015/394083
Ling He, Margaret Lech, Jing Zhang, Xiaomei Ren, Lihua Deng, Study of wavelet packet energy entropy for emotion classification in speech and glottal signals international conference on digital image processing. ,vol. 8878, pp. 887834- ,(2013) , 10.1117/12.2030929
Xiangming Kong, George C. Valley, Roy Matic, Error analysis and implementation considerations of decoding algorithms for time-encoding machine EURASIP Journal on Advances in Signal Processing. ,vol. 2011, pp. 1- 9 ,(2011) , 10.1186/1687-6180-2011-1