Performance Metrics for Model Fusion in Twitter Data Drifts

作者: Joana Costa , Catarina Silva , Mário Antunes , Bernardete Ribeiro

DOI: 10.1007/978-3-319-58838-4_2

关键词: Concept driftData miningWork (electrical)Diversity (business)Computer science

摘要: Ensemble approaches have revealed remarkable abilities to tackle different learning challenges, namely in dynamic scenarios with concept drift, e.g. social networks, as Twitter. Several efforts been engaged defining strategies combine the models that constitute an ensemble. In this work, we investigate effect of using metrics for combining ensembles’ models, specifically performance-based metrics. We propose five performance metrics, having mind may take advantage diversity classifiers, their individual takes a leading role contribution Experimental results on Twitter dataset, artificially timestamped, suggest ensemble can introduce relevant improvements overall performance.

参考文章(22)
Joana Costa, Catarina Silva, Mário Antunes, Bernardete Ribeiro, Defining Semantic Meta-hashtags for Twitter Classification Adaptive and Natural Computing Algorithms. pp. 226- 235 ,(2013) , 10.1007/978-3-642-37213-1_24
B. Scassellati, C. Crick, K. Gold, E. Kim, F. Shic, Ganghua Sun, Social development [robots] IEEE Computational Intelligence Magazine. ,vol. 1, pp. 41- 47 ,(2006) , 10.1109/MCI.2006.1672987
Joana Costa, Catarina Silva, Mario Antunes, Bernardete Ribeiro, The impact of longstanding messages in micro-blogging classification international joint conference on neural network. pp. 1- 8 ,(2015) , 10.1109/IJCNN.2015.7280731
Joana Costa, Catarina Silva, Mario Antunes, Bernardete Ribeiro, Concept Drift Awareness in Twitter Streams international conference on machine learning and applications. pp. 294- 299 ,(2014) , 10.1109/ICMLA.2014.53
Hsia-Ching Chang, A new perspective on Twitter hashtag use: diffusion of innovation theory association for information science and technology. ,vol. 47, pp. 85- ,(2010) , 10.1002/MEET.14504701295
Yoav Freund, Robert E Schapire, A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting conference on learning theory. ,vol. 55, pp. 119- 139 ,(1997) , 10.1006/JCSS.1997.1504
Gregory Ditzler, Robi Polikar, Incremental Learning of Concept Drift from Streaming Imbalanced Data IEEE Transactions on Knowledge and Data Engineering. ,vol. 25, pp. 2283- 2301 ,(2013) , 10.1109/TKDE.2012.136
Lei Yang, Tao Sun, Ming Zhang, Qiaozhu Mei, We know what @you #tag: does the dual role affect hashtag adoption? the web conference. pp. 261- 270 ,(2012) , 10.1145/2187836.2187872
Matthew Karnick, Michael D. Muhlbaier, Robi Polikar, Incremental learning in non-stationary environments with concept drift using a multiple classifier based approach international conference on pattern recognition. pp. 1- 4 ,(2008) , 10.1109/ICPR.2008.4761062
Oren Tsur, Ari Rappoport, What's in a hashtag? Proceedings of the fifth ACM international conference on Web search and data mining - WSDM '12. pp. 643- 652 ,(2012) , 10.1145/2124295.2124320