Salient Features for Anger Recognition in German and English IVR Portals

作者: Tim Polzehl , Alexander Schmitt , Florian Metze

DOI: 10.1007/978-1-4419-7934-6_4

关键词:

摘要: Anger recognition in speech dialogue systems can help to enhance human commputer interaction. In this chapter we report on the setup and performance opti-izationtechniques for successful anger classification using acoustic cues. We evaluate of a broad variety features both German an American English voice portal database which contain “real” (i.e. non-acted) continuous narrow-band quality. Starting with large-scale feature extraction, determine optimal sets combinations each language, by applying Information-Gain based ranking scheme. Analyzing notice that large proportion most promising databases are derived from MFCC loudness. contrast similarity also pitch proved importance database. further calculate scores our setups discriminative training Support-Vector Machine classification. The developed show

参考文章(29)
Felix Burkhardt, Roman Englert, Markus van Ballegooy, Richard Huber, An Emotion-Aware Voice Portal ,(2005)
Fu-Ming Lee, Li-Hua Li, Ru-Yi Huang, Recognizing low/high anger in speech for call centers international conference on signal processing. pp. 171- 176 ,(2008)
Tim Polzehl, Hamed Ketabdar, Michael Wagner, Florian Metze, Shiva Sundaram, Emotion Classification in Children's speech using fusion of acoustic and linguistic features conference of the international speech communication association. pp. 340- 343 ,(2009)
Walter F. Sendlmeier, Astrid Paeschke, Felix Burkhardt, Benjamin Weiss, M. Rolfes, A database of German emotional speech. conference of the international speech communication association. pp. 1517- 1520 ,(2005)
David G. Stork, Richard O. Duda, Peter E. Hart, Pattern Classification (2nd Edition) Wiley-Interscience. ,(2000)
Steven J. Simske, Xiaofan Lin, John Burns, Sherif M. Yacoub, Recognition of emotions in interactive voice response systems. conference of the international speech communication association. ,(2003)
Mark A. Hall, Ian H. Witten, Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques ,(1999)
P. Boersma, Praat, a system for doing phonetics by computer Glot International. ,vol. 5, pp. 341- 345 ,(2002)
Mark Davies, Joseph L. Fleiss, Measuring Agreement for Multinomial Data Biometrics. ,vol. 38, pp. 1047- ,(1982) , 10.2307/2529886