Comparison of Classification Methods for Detecting Emotion from Mandarin Speech

作者: T.-L. PAO , Y.-T. CHEN , J.-H. YEH

DOI: 10.1093/IETISY/E91-D.4.1074

关键词:

摘要: It is said that technology comes out from humanity. What humanity? The very definition of humanity emotion. Emotion the basis for all human expression and underlying theme behind everything done, said, thought or imagined. Making computers being able to perceive respond emotion, human-computer interaction will be more natural. Several classifiers are adopted automatically assigning an emotion category, such as anger, happiness sadness, a speech utterance. These were designed independently tested on various emotional corpora, making it difficult compare evaluate their performance. In this paper, we first compared several popular classification methods evaluated performance by applying them Mandarin corpus consisting five basic emotions, including happiness, boredom, sadness neutral. extracted feature streams contain MFCC, LPCC, LPC. experimental results show proposed WD-MKNN classifier achieves accuracy 81.4% 5-class recognition outperforms other techniques, KNN, MKNN, DW-KNN, LDA, QDA, GMM, HMM, SVM, BPNN. Then, verify advantage method, these another expressive two emotions. still others.

参考文章(23)
Tin Lay Nwe, Say Wei Foo, Liyanage C De Silva, None, Stress Classification Using Subband Based Features IEICE Transactions on Information and Systems. ,vol. 86, pp. 565- 573 ,(2003)
Tsang-Long Pao, Yu-Te Chen, Jun-Heng Yeh, Yun-Maw Cheng, Yu-Yuan Lin, A comparative study of different weighting schemes on KNN-based emotion recognition in mandarin speech international conference on intelligent computing. pp. 997- 1005 ,(2007) , 10.1007/978-3-540-74171-8_101
Valery A. Petrushin, EMOTION RECOGNITION IN SPEECH SIGNAL: EXPERIMENTAL STUDY, DEVELOPMENT, AND APPLICATION conference of the international speech communication association. pp. 222- 225 ,(2000)
Philippe Zimmermann, Sissel Guttormsen, Brigitta Danuser, Patrick Gomez, Affective computing--a rationale for measuring mood with mouse and keyboard. International Journal of Occupational Safety and Ergonomics. ,vol. 9, pp. 539- 551 ,(2003) , 10.1080/10803548.2003.11076589
Roberto Pieraccini, Chul Min Lee, Shrikanth S. Narayanan, COMBINING ACOUSTIC AND LANGUAGE INFORMATION FOR EMOTION RECOGNITION conference of the international speech communication association. pp. 873- 876 ,(2002)
Tsang-Long Pao, Yu-Te Chen, Jun-Heng Yeh, Wen-Yuan Liao, Combining Acoustic Features for Improved Emotion Recognition in Mandarin Speech Affective Computing and Intelligent Interaction. pp. 279- 285 ,(2005) , 10.1007/11573548_36
Xuan Hung Le, G. Quenot, E. Castelli, Recognizing emotions for the audio-visual document indexing international symposium on computers and communications. ,vol. 2, pp. 580- 584 ,(2004) , 10.1109/ISCC.2004.1358600
Paul Ekman, An argument for basic emotions Cognition & Emotion. ,vol. 6, pp. 169- 200 ,(1992) , 10.1080/02699939208411068
K Chan, J Hao, O W Kwon, EMOTION RECOGNITION BY SPEECH SIGNAL conference of the international speech communication association. pp. 125- 128 ,(2003)
Donn Morrison, Ruili Wang, Liyanage C De Silva, None, Spoken affect classification using neural networks granular computing. ,vol. 2, pp. 583- 586 ,(2005) , 10.1109/GRC.2005.1547359