GSLDA: LDA-based group spamming detection in product reviews

作者: Zhuo Wang , Songmin Gu , Xiaowei Xu

DOI: 10.1007/S10489-018-1142-1

关键词: Function (engineering)Information retrievalLatent Dirichlet allocationThe InternetQuality (business)Field (computer science)Context (language use)Order (business)SpammingComputer science

摘要: Online product reviews are becoming increasingly important due to their guidance function in people’s purchase decisions. As being highly subjective, online subject opinion spamming, i.e., fraudsters write fake or give unfair ratings promote demote target products. Although there have been much efforts this field, the problem is still left open difficulties gathering ground-truth data. more and people using Internet everyday life, group review which involves a of writing hype-reviews (promote) defaming-reviews (demote) for one products, becomes main form spamming. In paper, we propose LDA-based computing framework, namely GSLDA, spamming detection completely unsupervised approach, GSLDA works two phases. It first adapts LDA (Latent Dirichlet Allocation) context order bound closely related spammers into small-sized reviewer cluster, then it extracts high suspicious groups from each LDA-clusters. Experiments on three real-world datasets show that can detect quality spammer groups, outperforming many state-of-the-art baselines terms accuracy.

参考文章(26)
Euijin Choo, Ting Yu, Min Chi, Detecting Opinion Spammer Groups Through Community Discovery and Sentiment Analysis 29th IFIP Annual Conference on Data and Applications Security and Privacy (DBSEC). pp. 170- 187 ,(2015) , 10.1007/978-3-319-20810-7_11
Vivek Venkataraman, Arjun Mukherjee, Bing Liu, Natalie S. Glance, What Yelp Fake Review Filter Might Be Doing international conference on weblogs and social media. pp. 409- 418 ,(2013)
Michael Crawford, Taghi M. Khoshgoftaar, Joseph D. Prusa, Aaron N. Richter, Hamzah Al Najada, Survey of review spam detection using machine learning techniques Journal of Big Data. ,vol. 2, pp. 23- ,(2015) , 10.1186/S40537-015-0029-9
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Junting Ye, Leman Akoglu, Discovering Opinion Spammer Groups by Network Footprints conference on online social networks. pp. 97- 97 ,(2015) , 10.1145/2817946.2820606
Chang Xu, Jie Zhang, Kuiyu Chang, Chong Long, Uncovering collusive spammers in Chinese review websites conference on information and knowledge management. pp. 979- 988 ,(2013) , 10.1145/2505515.2505700
Arjun Mukherjee, Abhinav Kumar, Bing Liu, Junhui Wang, Meichun Hsu, Malu Castellanos, Riddhiman Ghosh, Spotting opinion spammers using behavioral footprints knowledge discovery and data mining. pp. 632- 640 ,(2013) , 10.1145/2487575.2487580
Nitin Jindal, Bing Liu, Opinion spam and analysis web search and data mining. pp. 219- 230 ,(2008) , 10.1145/1341531.1341560
Shebuti Rayana, Leman Akoglu, Collective Opinion Spam Detection: Bridging Review Networks and Metadata knowledge discovery and data mining. pp. 985- 994 ,(2015) , 10.1145/2783258.2783370
Arjun Mukherjee, Bing Liu, Natalie Glance, Spotting fake reviewer groups in consumer reviews the web conference. pp. 191- 200 ,(2012) , 10.1145/2187836.2187863