The Vera am Mittag German audio-visual emotional speech database

作者: Michael Grimm , Kristian Kroschel , Shrikanth Narayanan

DOI: 10.1109/ICME.2008.4607572

关键词:

摘要: The lack of publicly available annotated databases is one the major barriers to research advances on emotional information processing. In this contribution we present a recently collected database spontaneous speech in German which being made community. consists 12 hours audio-visual recordings TV talk show ldquoVera am Mittagrdquo, segmented into broadcasts, dialogue acts and utterances. This corpus contains very recorded from unscripted, authentic discussions between guests show. addition data utterances provide emotion labels for great part data. are given continuous valued scale three primitives: valence, activation dominance, using large number human evaluators. Such interest all groups working analysis, recognition both facial expression, natural language understanding, robust recognition.

参考文章(9)
Ellen Douglas-Cowie, Roddy Cowie, Marc Schröder, A new emotion database: considerations, sources and scope Speech and Emotion: Proceedings of the ISCA workshop. ,(2000)
Paul Ekman, Universals and cultural differences in facial expressions of emotion. Nebraska symposium on motivation, 1971. ,vol. 1971, pp. 207- 282 ,(1972)
Michael Grimm, Kristian Kroschel, Shrikanth Narayanan, Support Vector Regression for Automatic Recognition of Spontaneous Emotions in Speech international conference on acoustics, speech, and signal processing. ,vol. 4, pp. 1085- 1088 ,(2007) , 10.1109/ICASSP.2007.367262
Ellen Douglas-Cowie, Roddy Cowie, Ian Sneddon, Cate Cox, Orla Lowry, Margaret McRorie, Jean-Claude Martin, Laurence Devillers, Sarkis Abrilian, Anton Batliner, Noam Amir, Kostas Karpouzis, The HUMAINE Database: Addressing the Collection and Annotation of Naturalistic and Induced Emotional Data affective computing and intelligent interaction. pp. 488- 500 ,(2007) , 10.1007/978-3-540-74889-2_43
M. Grimm, K. Kroschel, Evaluation of natural emotions using self assessment manikins ieee automatic speech recognition and understanding workshop. pp. 381- 385 ,(2005) , 10.1109/ASRU.2005.1566530
Ellen Douglas-Cowie, Nick Campbell, Roddy Cowie, Peter Roach, Emotional speech: Towards a new generation of databases Speech Communication. ,vol. 40, pp. 33- 60 ,(2003) , 10.1016/S0167-6393(02)00070-5
Lijun Yin, Xiaozhou Wei, Yi Sun, Jun Wang, M.J. Rosato, A 3D facial expression database for facial behavior research international conference on automatic face and gesture recognition. pp. 211- 216 ,(2006) , 10.1109/FGR.2006.6
Dimitrios Ververidis, Constantine Kotropoulos, Emotional speech recognition: Resources, features, and methods Speech Communication. ,vol. 48, pp. 1162- 1181 ,(2006) , 10.1016/J.SPECOM.2006.04.003
Michael Grimm, Kristian Kroschel, Emily Mower, Shrikanth Narayanan, Primitives-based evaluation and estimation of emotions in speech Speech Communication. ,vol. 49, pp. 787- 800 ,(2007) , 10.1016/J.SPECOM.2007.01.010