Discovering Opinion Spammer Groups by Network Footprints

作者: Junting Ye , Leman Akoglu

DOI: 10.1145/2817946.2820606

关键词:

摘要: Online reviews are an important source for consumers to evaluate products/services on the Internet (e.g. Amazon, Yelp, etc.). However, more and fraudulent reviewers write fake mislead users. To maximize their impact share effort, many spam attacks organized as campaigns, by a group of spammers. In this paper, we propose new two-step method discover spammer groups targeted products. First, introduce NFS (Network Footprint Score), measure that quantifies likelihood products being campaign targets. Second, carefully devise GroupStrainer cluster spammers 2-hop subgraph induced top ranking Our approach has four key advantages: (i) unsupervised detection; both steps require no labeled data, (ii) adversarial robustness; quantify statistical distortions in review network, which have only partial view, avoid any side information can easily evade, (iii) sensemaking; output facilitates exploration nested hierarchy (i.e., organization) among spammers, finally (iv) scalability; complexity linear network size, moreover, operates subnetwork. We demonstrate efficiency effectiveness our synthetic real-world datasets from two different domains with millions reviewers. Moreover, interesting strategies employ through case studies detected groups.

参考文章(30)
Károly Csalogány, András A. Benczúr, Tamás Sarlós, Máté Uher, SpamRank -- Fully Automatic Link Spam Detection. adversarial information retrieval on the web. pp. 25- 38 ,(2005)
Leman Akoglu, Mary McGlohon, Christos Faloutsos, OddBall: spotting anomalies in weighted graphs knowledge discovery and data mining. ,vol. 6119, pp. 410- 421 ,(2010) , 10.1007/978-3-642-13672-6_40
Piotr Indyk, Aristides Gionis, Rajeev Motwani, Similarity Search in High Dimensions via Hashing very large data bases. pp. 518- 529 ,(1999)
Leman Akoglu, Christos Faloutsos, RTG: A Recursive Realistic Graph Generator Using Random Typing european conference on machine learning. pp. 13- 28 ,(2009) , 10.1007/978-3-642-04180-8_13
Fangtao Li, Xiaoyan Zhu, Minlie Huang, Yi Yang, Learning to identify review spam international joint conference on artificial intelligence. pp. 2488- 2493 ,(2011) , 10.5591/978-1-57735-516-8/IJCAI11-414
Michalis Faloutsos, Petros Faloutsos, Christos Faloutsos, On power-law relationships of the Internet topology acm special interest group on data communication. ,vol. 29, pp. 251- 262 ,(1999) , 10.1145/316188.316229
Chang Xu, Jie Zhang, Kuiyu Chang, Chong Long, Uncovering collusive spammers in Chinese review websites conference on information and knowledge management. pp. 979- 988 ,(2013) , 10.1145/2505515.2505700
Arjun Mukherjee, Abhinav Kumar, Bing Liu, Junhui Wang, Meichun Hsu, Malu Castellanos, Riddhiman Ghosh, Spotting opinion spammers using behavioral footprints knowledge discovery and data mining. pp. 632- 640 ,(2013) , 10.1145/2487575.2487580
Nitin Jindal, Bing Liu, Opinion spam and analysis web search and data mining. pp. 219- 230 ,(2008) , 10.1145/1341531.1341560
Huayi Li, Zhiyuan Chen, Bing Liu, Xiaokai Wei, Jidong Shao, Spotting Fake Reviews via Collective Positive-Unlabeled Learning 2014 IEEE International Conference on Data Mining. pp. 899- 904 ,(2014) , 10.1109/ICDM.2014.47