Detecting Hotspot Information Using Multi-Attribute Based Topic Model.

作者: Jing Wang , Li Li , Feng Tan , Ying Zhu , Weisi Feng

DOI: 10.1371/JOURNAL.PONE.0140539

关键词:

摘要: Microblogging as a kind of social network has become more and important in our daily lives. Enormous amounts information are produced shared on basis. Detecting hot topics the mountains can help people get to essential quickly. However, due short sparse features, large number meaningless tweets other characteristics microblogs, traditional topic detection methods often ineffective detecting topics. In this paper, we propose new model named multi-attribute latent dirichlet allocation (MA-LDA), which time hashtag attributes microblogs incorporated into LDA model. By introducing attribute, MA-LDA decide whether word should appear or not. Meanwhile, compared with model, applying attribute gives core words an artificially high ranking results meaning expressiveness outcomes be improved. Empirical evaluations real data sets demonstrate that method is able detect accurately efficiently several baselines. Our provides strong evidence importance temporal factor extracting

参考文章(40)
R. Papka, J. Allan, On-Line New Event Detection using Single Pass Clustering TITLE2: University of Massachusetts. ,(1998)
Ying Zhu, Li Li, Le Luo, Learning to Classify Short Text with Topic Model and External Knowledge Knowledge Science, Engineering and Management. pp. 493- 503 ,(2013) , 10.1007/978-3-642-39787-5_41
Juanzi Li, Siqiang Wen, Zhixing Li, Jie Tang, Peng Zhang, On Modelling Non-linear Topical Dependencies international conference on machine learning. pp. 458- 466 ,(2014)
Bo Huang, Yan Yang, Amjad Mahmood, Hongjun Wang, Microblog Topic Detection Based on LDA Model and Single-Pass Clustering International Conference on Rough Sets and Current Trends in Computing. pp. 166- 171 ,(2012) , 10.1007/978-3-642-32115-3_19
Fabian Abel, Qi Gao, Geert-Jan Houben, Ke Tao, Semantic Enrichment of Twitter Posts for User Profile Construction on the Social Web The Semanic Web: Research and Applications. pp. 375- 389 ,(2011) , 10.1007/978-3-642-21064-8_26
Marzena Kryszkiewicz, Marcin Szczuka, Sheela Ramanna, Qinghua Hu, Richard Jensen, Rough Sets and Current Trends in Computing ,(2011)
Zhen Wang, Satoshi Kokubo, Marko Jusup, Jun Tanimoto, Universal scaling for the dilemma strength in evolutionary games. Physics of Life Reviews. ,vol. 14, pp. 1- 30 ,(2015) , 10.1016/J.PLREV.2015.04.033
W. R. Gilks, Markov Chain Monte Carlo Encyclopedia of Biostatistics. ,(2005) , 10.1002/0470011815.B2A14021
Khoo Khyou Bun, M. Ishizuka, Topic extraction from news archive using TF*PDF algorithm web information systems engineering. pp. 73- 82 ,(2002) , 10.1109/WISE.2002.1181645
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937