Handling Topic Drift for Topic Tracking in Microblogs

作者: Yue Fei , Yihong Hong , Jianwu Yang

DOI: 10.1007/978-3-319-16354-3_52

关键词:

摘要: Microblogs such as Twitter have become an increasingly popular source of real-time information, where users may demand tracking the development topics they are interested in. We approach problem by adapting effective classifier based on Binomial Logistic Regression, which has shown to be state-of-art in traditional news filtering. In our adaptation, we utilize link information enrich tweets’ content and social symbols help estimate quality. Moreover, find that very likely drift microblogs a result redundancy topic divergence tweets. To handle over time, adopt cluster-based subtopic detection algorithm identify whether occurs detected is regarded current focus general adjust drift. Experimental results corpus TREC2012 Microblog Track show achieves remarkable performance both T11SU F-0.5 metrics.

参考文章(26)
Ralf Klinkenberg, Lehrstuhl Informatik Viii, Daimler-Benz Ag, Ingrid Renz, Adaptive Information Filtering: Learning in the Presence of Concept Drifts ,(1998)
Olfa Nasraoui, Bamshad Mobasher, John Yen, Lee Giles, Andrew McCallum, Jaideep Srivastava, Haizheng Zhang, Myra Spiliopoulou, Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis knowledge discovery and data mining. ,(2007)
Stephen Robertson, Threshold Setting and Performance Optimization in Adaptive Filtering Information Retrieval. ,vol. 5, pp. 239- 256 ,(2002) , 10.1023/A:1015702129514
Hinrich Schütze, Christopher D. Manning, Prabhakar Raghavan, Introduction to Information Retrieval ,(2005)
Xiaoming Zhang, Zhoujun Li, Automatic Topic Detection with an Incremental Clustering Algorithm Web Information Systems and Mining. pp. 344- 351 ,(2010) , 10.1007/978-3-642-16515-3_43
Craig Macdonald, Iadh Ounis, Voting for candidates Proceedings of the 15th ACM international conference on Information and knowledge management - CIKM '06. pp. 387- 396 ,(2006) , 10.1145/1183614.1183671
Feng Liang, Runwei Qiang, Jianwu Yang, Exploiting real-time information retrieval in the microblogosphere acm/ieee joint conference on digital libraries. pp. 267- 276 ,(2012) , 10.1145/2232817.2232867
Frank E. Grubbs, Procedures for Detecting Outlying Observations in Samples Technometrics. ,vol. 11, pp. 1- 21 ,(1969) , 10.1080/00401706.1969.10490657
James Allan, Incremental relevance feedback for information filtering international acm sigir conference on research and development in information retrieval. pp. 270- 278 ,(1996) , 10.1145/243199.243274