Multimodal fusion of audio, scene, and face features for first impression estimation

作者: Furkan Gurpinar , Heysem Kaya , Albert Ali Salah

DOI: 10.1109/ICPR.2016.7899605

关键词: Affective computingSpeech recognitionComputer visionVisualizationConvolutional neural networkArtificial intelligenceBig Five personality traitsTest setFeature extractionComputer sciencePersonalityFirst impression (psychology)Face (geometry)

摘要: Affective computing, particularly emotion and personality trait recognition, is of increasing interest in many research disciplines. The interplay shows itself the first impression left on other people. Moreover, ambient information, e.g. environment objects surrounding subject, also affect these impressions. In this work, we employ pre-trained Deep Convolutional Neural Networks to extract facial information from images for predicting apparent personality. We investigate Local Gabor Binary Patterns Three Orthogonal Planes video descriptor acoustic features extracted via popularly used openSMILE tool. subsequently propose classifying using a Kernel Extreme Learning Machine fusing their predictions. proposed system applied ChaLearn Challenge First Impression Recognition, achieving winning test set accuracy 0.913, averaged over “Big Five” traits.

参考文章(29)
Petr Motlicek, Samuel Kim, Fabio Valente, Annotation and Recognition of Personality Traits in Spoken Conversations from the AMI Meetings Corpus conference of the international speech communication association. pp. 1183- 1186 ,(2012)
Laurence Devillers, Björn W. Schuller, Stefan Steidl, Felix Burkhardt, Shrikanth S. Narayanan, Anton Batliner, Christian A. Müller, The INTERSPEECH 2010 Paralinguistic Challenge conference of the international speech communication association. pp. 2794- 2797 ,(2010)
Björn W. Schuller, Stefan Steidl, Anton Batliner, The INTERSPEECH 2009 Emotion Challenge conference of the international speech communication association. pp. 312- 315 ,(2009)
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
Sonja Gievska, Kiril Koroveshovski, The Impact of Affective Verbal Content on Predicting Personality Impressions in YouTube Videos Proceedings of the 2014 ACM Multi Media on Workshop on Computational Personality Recognition. pp. 19- 22 ,(2014) , 10.1145/2659522.2659529
Chandrima Sarkar, Sumit Bhatia, Arvind Agarwal, Juan Li, Feature Analysis for Computational Personality Recognition Using YouTube Personality Data set Proceedings of the 2014 ACM Multi Media on Workshop on Computational Personality Recognition. pp. 11- 14 ,(2014) , 10.1145/2659522.2659528
Golnoosh Farnadi, Shanu Sushmita, Geetha Sitaraman, Nhat Ton, Martine De Cock, Sergio Davalos, A Multivariate Regression Approach to Personality Impression Recognition of Vloggers acm multimedia. pp. 1- 6 ,(2014) , 10.1145/2659522.2659526
Firoj Alam, Giuseppe Riccardi, Predicting Personality Traits using Multimodal Information Proceedings of the 2014 ACM Multi Media on Workshop on Computational Personality Recognition. pp. 15- 18 ,(2014) , 10.1145/2659522.2659531
Guang-Bin Huang, Hongming Zhou, Xiaojian Ding, Rui Zhang, Extreme Learning Machine for Regression and Multiclass Classification systems man and cybernetics. ,vol. 42, pp. 513- 529 ,(2012) , 10.1109/TSMCB.2011.2168604
Heysem Kaya, Albert Ali Salah, Continuous Mapping of Personality Traits: A Novel Challenge and Failure Conditions Proceedings of the 2014 Workshop on Mapping Personality Traits Challenge and Workshop. pp. 17- 24 ,(2014) , 10.1145/2668024.2668025