On-line Emotion Recognition in a 3-D Activation-Valence-Time Continuum using Acoustic and Linguistic Cues

作者: Florian Eyben , Martin Wöllmer , Alex Graves , Björn Schuller , Ellen Douglas-Cowie

DOI: 10.1007/S12193-009-0032-6

关键词: Computer scienceRecurrent neural netsLong short term memoryTime delay neural networkLinguisticsEmotion recognitionSpeech recognitionIntelligent character recognitionRecurrent neural networkSpeaker recognition

摘要: For many applications of emotion recognition, such as virtual agents, the system must select responses while user is speaking. This requires reliable on-line recognition user’s affect. However most systems are based on turnwise processing. We present a novel approach to from speech using Long Short-Term Memory Recurrent Neural Networks. Emotion recognised frame-wise in two-dimensional valence-activation continuum. In contrast current state-of-the-art approaches, performed low-level signal frames, similar those used for recognition. No statistical functionals applied feature contours. Framing at higher level therefore unnecessary and regression outputs can be produced real-time every input frame. also investigate benefits including linguistic features frame obtained by keyword spotter.

参考文章(50)
Michael Streit, Anton Batliner, Thomas Portele, Emotion Analysis and Emotion-Handling Subdialogues SmartKom: Foundations of Multimodal Dialogue Systems. pp. 317- 332 ,(2006) , 10.1007/3-540-36678-4_21
Nhu Nguyen-Thien, Martin Wöllmer, Tobias Moosmayr, Björn W. Schuller, Florian Eyben, Yang Sun, Robust In-Car Spelling Recognition - A Tandem BLSTM-HMM Approach conference of the international speech communication association. pp. 2507- 2510 ,(2009)
Ellen Douglas-Cowie, Cate Cox, Jean-Claude Martin, Laurence Devillers, Roddy Cowie, Ian Sneddon, Margaret McRorie, Catherine Pelachaud, Christopher Peters, Orla Lowry, Anton Batliner, Florian Hönig, The HUMAINE Database Springer Berlin Heidelberg. pp. 243- 284 ,(2011) , 10.1007/978-3-642-15184-2_14
Laurence Vidrascu, Vered Aharonson, Dino Seppi, Thurid Vogt, Laurence Devillers, Noam Amir, Björn W. Schuller, Stefan Steidl, Johannes Wagner, Anton Batliner, Patterns, Prototypes, Performance: Classifying Emotional User States conference of the international speech communication association. pp. 601- 604 ,(2008)
Gerhard Rigoll, Björn W. Schuller, Timing levels in segment-based speech emotion recognition. conference of the international speech communication association. ,(2006)
Walter F. Sendlmeier, Astrid Paeschke, Felix Burkhardt, Benjamin Weiss, M. Rolfes, A database of German emotional speech. conference of the international speech communication association. pp. 1517- 1520 ,(2005)
Ellen Douglas-Cowie, Martin Wöllmer, Roddy Cowie, Björn W. Schuller, Cate Cox, Florian Eyben, Stephan Reiter, Abandoning Emotion Classes - Towards Continuous Emotion Recognition with Modelling of Long-Range Dependencies conference of the international speech communication association. pp. 597- 600 ,(2008)
Roddy Cowie, Ellen Douglas-Cowie, Susie Savvidou*, Edelle McMahon, Martin Sawey, Marc Schröder, FEELTRACE: an instrument for recording perceived emotion in real time Speech and Emotion: Proceedings of the ISCA workshop. ,(2000)
Björn W. Schuller, Stefan Steidl, Anton Batliner, The INTERSPEECH 2009 Emotion Challenge conference of the international speech communication association. pp. 312- 315 ,(2009)
Santiago Fernández, Alex Graves, Jürgen Schmidhuber, An Application of Recurrent Neural Networks to Discriminative Keyword Spotting Lecture Notes in Computer Science. pp. 220- 229 ,(2007) , 10.1007/978-3-540-74695-9_23