Cross-Lingual Classification of Crisis Data

作者: Prashant Khare , Grégoire Burel , Diana Maynard , Harith Alani

DOI: 10.1007/978-3-030-00671-6_36

关键词:

摘要: Many citizens nowadays flock to social media during crises share or acquire the latest information about event. Due sheer volume of data typically circulated such events, it is necessary be able efficiently filter out irrelevant posts, thus focusing attention on posts that are truly relevant crisis. Current methods for classifying relevance a crisis set struggle deal with in different languages, and not viable rapidly evolving situations train new models each language. In this paper we test statistical semantic classification approaches cross-lingual datasets from 30 consisting written mainly English, Spanish, Italian. We experiment scenarios where model trained one language tested another, translated single show addition features extracted external knowledge bases improve accuracy over purely model.

参考文章(24)
Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz, Patrick Meier, Practical extraction of disaster-relevant information from social media the web conference. pp. 1021- 1024 ,(2013) , 10.1145/2487788.2488109
Robert Power, Bella Robinson, John Colton, Mark Cameron, Emergency Situation Awareness: Twitter Case Studies Lecture Notes in Business Information Processing. pp. 218- 231 ,(2014) , 10.1007/978-3-319-11818-5_19
Sarvnaz Karimi, Jie Yin, Cecile Paris, Classifying microblogs for disasters australasian document computing symposium. pp. 26- 33 ,(2013) , 10.1145/2537734.2537737
J. Rogstadius, M. Vukovic, C. A. Teixeira, V. Kostakos, E. Karapanos, J. A. Laredo, CrisisTracker: crowdsourced social media curation for disaster awareness Journal of Reproduction and Development. ,vol. 57, pp. 1- ,(2013) , 10.1147/JRD.2013.2260692
Rui Li, Kin Hou Lei, Ravi Khadiwala, Kevin Chen-Chuan Chang, TEDAS: A Twitter-based Event Detection and Analysis System 2012 IEEE 28th International Conference on Data Engineering. pp. 1273- 1276 ,(2012) , 10.1109/ICDE.2012.125
Sarah Vieweg, Amanda L. Hughes, Kate Starbird, Leysia Palen, Microblogging during two natural hazards events: what twitter may contribute to situational awareness human factors in computing systems. pp. 1079- 1088 ,(2010) , 10.1145/1753326.1753486
Roberto Navigli, Simone Paolo Ponzetto, BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network Artificial Intelligence. ,vol. 193, pp. 217- 250 ,(2012) , 10.1016/J.ARTINT.2012.07.001
Takeshi Sakaki, Makoto Okazaki, Yutaka Matsuo, Earthquake shakes Twitter users: real-time event detection by social sensors the web conference. pp. 851- 860 ,(2010) , 10.1145/1772690.1772777
Carmen Banea, Rada Mihalcea, Janyce Wiebe, Learning Multilingual Subjective Language via Cross-Lingual Projections meeting of the association for computational linguistics. pp. 976- 983 ,(2007)