Speech emotion detection using time dependent self organizing maps

作者: Haythem Balti , Adel S. Elmaghraby

DOI: 10.1109/ISSPIT.2013.6781926

关键词:

摘要: We propose a framework for speech emotion detection that maps acoustic features into high level descriptors integrates time context. Our uses three different algorithms to integrate the temporal The first method is based on averaging of original features. second algorithm derives by clustering data using self-organizing (SOMs) and computing average activity distribution map. third multi resolution window analysis SOMs compute 2-D map emotions trajectories representing behavior Using standard emotional database K-nearest neighbors classifier, we show proposed efficient analysis, visualization classification emotions.

参考文章(33)
David H. Wolpert, Original Contribution: Stacked generalization Neural Networks. ,vol. 5, pp. 241- 259 ,(1992) , 10.1016/S0893-6080(05)80023-1
Andreas Rauber, Rudolf Mayer, Jakob Frank, Analytic Comparison of Audio Feature Sets using Self-Organising Maps ,(2009)
I Hernaez, I Luengo, E Navas, AUTOMATIC EMOTION RECOGNITION USING PROSODIC PARAMETERS conference of the international speech communication association. pp. 493- 496 ,(2005)
Tsang-Long Pao, Yu-Te Chen, Jun-Heng Yeh, Wen-Yuan Liao, Combining Acoustic Features for Improved Emotion Recognition in Mandarin Speech Affective Computing and Intelligent Interaction. pp. 279- 285 ,(2005) , 10.1007/11573548_36
Markus Varsta, Jukka Heikkonen, Jouko Lampinen, José Del R. Millán, Temporal Kohonen Map and the Recurrent Self-Organizing Map: Analytical and Experimental Comparison Neural Processing Letters. ,vol. 13, pp. 237- 251 ,(2001) , 10.1023/A:1011353011837
H.E. Rickard, G.D. Tourassi, A.S. Elmaghraby, Self-organizing maps for masking mammography images ieee international conference on information technology and applications in biomedicine. pp. 302- 305 ,(2003) , 10.1109/ITAB.2003.1222538
Daniel Neiberg, Kjell Elenius, Kornel Laskowski, Emotion Recognition in Spontaneous Speech Using GMMs international conference on spoken language processing. pp. 809- 812 ,(2006)
H. Wakita, Residual energy of linear prediction applied to vowel and speaker recognition IEEE Transactions on Acoustics, Speech, and Signal Processing. ,vol. 24, pp. 270- 271 ,(1976) , 10.1109/TASSP.1976.1162797