Multimodal Egocentric Analysis of Focused Interactions

作者: Sophia Bano , Tamas Suveges , Jianguo Zhang , Stephen J. Mckenna

DOI: 10.1109/ACCESS.2018.2850284

关键词:

摘要: Continuous detection of social interactions from wearable sensor data streams has a range potential applications in domains, including health and care, security, assistive technology. We contribute an annotated, multimodal set capturing such using video, audio, GPS, inertial sensing. present methods for automatic temporal segmentation focused support vector machines recurrent neural networks with features extracted both audio video streams. The interaction occurs when the co-present individuals, having mutual focus attention, interact by first establishing face-to-face engagement direct conversation. describe evaluation protocol, framewise, extended event-based measures, provide empirical evidence that fusion visual face track scores voice activity provides effective combination. methods, contributed set, protocol together benchmark future research on this problem. is available at https://doi.org/10.15132/10000134 .

参考文章(41)
Stefano Alletto, Giuseppe Serra, Simone Calderara, Rita Cucchiara, Understanding social relationships in egocentric vision Pattern Recognition. ,vol. 48, pp. 4082- 4096 ,(2015) , 10.1016/J.PATCOG.2015.06.006
M. S. Ryoo, Larry Matthies, First-Person Activity Recognition: Feature, Temporal Structure, and Prediction International Journal of Computer Vision. ,vol. 119, pp. 307- 328 ,(2016) , 10.1007/S11263-015-0847-4
David S. Hayden, Wearable-assisted social interaction as assistive technology for the blind Massachusetts Institute of Technology. ,(2014)
Javier Ramirez, Juan Manuel Górriz, José Carlos Segura, Voice Activity Detection. Fundamentals and Speech Recognition System Robustness InTech. ,(2007) , 10.5772/4740
Davide Anguita, Alessandro Ghio, Luca Oneto, Xavier Parra, Jorge L. Reyes-Ortiz, Human activity recognition on smartphones using a multiclass hardware-friendly support vector machine international workshop on ambient assisted living. pp. 216- 223 ,(2012) , 10.1007/978-3-642-35395-6_30
Datong Chen, Jie Yang, Robert Malkin, Howard D. Wactlar, Detecting social interactions of the elderly in a nursing home environment ACM Transactions on Multimedia Computing, Communications, and Applications. ,vol. 3, pp. 6- ,(2007) , 10.1145/1198302.1198308
S. Coradeschi, A. Cesta, G. Cortellessa, L. Coraci, J. Gonzalez, L. Karlsson, F. Furfari, A. Loutfi, A. Orlandini, F. Palumbo, F. Pecora, S. von Rump, A. Stimec, J. Ullberg, B. Otslund, GiraffPlus: Combining social interaction and long term monitoring for promoting independent living international conference on human system interactions. pp. 578- 585 ,(2013) , 10.1109/HSI.2013.6577883
Xavier Alameda-Pineda, Yan Yan, Elisa Ricci, Oswald Lanz, Nicu Sebe, Analyzing Free-standing Conversational Groups: A Multimodal Approach acm multimedia. pp. 5- 14 ,(2015) , 10.1145/2733373.2806238
Xavier Alameda-Pineda, Vasil Khalidov, Radu Horaud, Florence Forbes, Finding audio-visual events in informal social gatherings international conference on multimodal interfaces. pp. 247- 254 ,(2011) , 10.1145/2070481.2070527
K. G. Manosha Chathuramali, Ranga Rodrigo, Faster human activity recognition with SVM international conference on advances in ict for emerging regions. pp. 197- 203 ,(2012) , 10.1109/ICTER.2012.6421415