Concept Drift Awareness in Twitter Streams

作者: Joana Costa , Catarina Silva , Mario Antunes , Bernardete Ribeiro

DOI: 10.1109/ICMLA.2014.53

关键词:

摘要: Learning in non-stationary environments is not an easy task and requires a distinctive approach. The learning model must only have the ability to continuously learn, but also acquired new concepts forget old ones. Additionally, given significant importance that social networks gained as information networks, there ever-growing interest extraction of complex used for trend detection, promoting services or market sensing. This dynamic nature tends limit performance traditional static models strategies be put forward. In this paper we present strategy learn with drift occurrence Twitter. We propose three different models: time-window model, ensemble-based incremental model. Since little known about types can occur Twitter, simulate by artificially time stamping real Twitter messages order evaluate validate our strategy. Results are so far encouraging regarding presence drift, along classifying streams.

参考文章(30)
Joana Costa, Catarina Silva, Mário Antunes, Bernardete Ribeiro, Defining Semantic Meta-hashtags for Twitter Classification Adaptive and Natural Computing Algorithms. pp. 226- 235 ,(2013) , 10.1007/978-3-642-37213-1_24
Fabian Abel, Qi Gao, Geert-Jan Houben, Ke Tao, Semantic Enrichment of Twitter Posts for User Profile Construction on the Social Web The Semanic Web: Research and Applications. pp. 375- 389 ,(2011) , 10.1007/978-3-642-21064-8_26
Mor Naaman, Hila Becker, Luis Gravano, Beyond Trending Topics: Real-World Event Identification on Twitter international conference on weblogs and social media. ,(2011) , 10.7916/D81V5NVX
Andranik Tumasjan, Isabell M. Welpe, Philipp G. Sandner, Timm Oliver Sprenger, Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment international conference on weblogs and social media. ,(2010)
Philippe Laublet, Milan Stankovic, Matthew Rowe, Mapping tweets to conference talks: a goldmine for semantics The Editors. ,(2010)
Martin Treiber, Daniel Schall, Schahram Dustdar, Christian Scherling, Tweetflows Proceeding of the 3rd international workshop on Principles of engineering service-oriented systems - PESOS '11. pp. 1- 7 ,(2011) , 10.1145/1985394.1985395
Benjamin Doerr, Mahmoud Fouz, Tobias Friedrich, Why rumors spread so quickly in social networks Communications of the ACM. ,vol. 55, pp. 70- 75 ,(2012) , 10.1145/2184319.2184338
Hsia-Ching Chang, A new perspective on Twitter hashtag use: diffusion of innovation theory association for information science and technology. ,vol. 47, pp. 85- ,(2010) , 10.1002/MEET.14504701295