Exploring Linkability of User Reviews

作者: Mishari Almishari , Gene Tsudik

DOI: 10.1007/978-3-642-33167-1_18

关键词: Simple (philosophy)Simple FeaturesWorld Wide WebLarge set (Ramsey theory)Set (abstract data type)Computer science

摘要: Large numbers of people all over the world read and contribute to various review sites. Many contributors are understandably concerned about privacy in general and, specifically, linkability their reviews (and accounts) across multiple In this paper, we study community-based reviewing try answer question: what extent ”anonymous” linkable, i.e., highly likely authored by same contributor? Based on a very large set from one popular site (Yelp), show that high percentage ostensibly anonymous can be accurately linked authors. This is despite fact use simple models equally features set. Our suggests reliably expose identities reviews. has important implications for cross-referencing accounts between different Also, techniques used our could adopted sites give feedback

参考文章(18)
William Aiello, Andrew Warfield, Mihir Nanavati, Nathan Taylor, Herbert west: deanonymizer usenix conference on hot topics in security. pp. 6- 6 ,(2011)
Christopher M. Bishop, Pattern Recognition and Machine Learning ,(2006)
David D. Lewis, Naive (Bayes) at forty: The independence assumption in information retrieval Machine Learning: ECML-98. pp. 4- 15 ,(1998) , 10.1007/BFB0026666
Dan Frankowski, Dan Cosley, Shilad Sen, Loren Terveen, John Riedl, You are what you say Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '06. pp. 565- 572 ,(2006) , 10.1145/1148170.1148267
Sadia Afroz, Michael Brennan, Rachel Greenstadt, Detecting Hoaxes, Frauds, and Deception in Writing Style Online ieee symposium on security and privacy. pp. 461- 475 ,(2012) , 10.1109/SP.2012.34
Nitin Jindal, Bing Liu, Opinion spam and analysis web search and data mining. pp. 219- 230 ,(2008) , 10.1145/1341531.1341560
Arvind Narayanan, Hristo Paskov, Neil Zhenqiang Gong, John Bethencourt, Emil Stefanov, Eui Chul Richard Shin, Dawn Song, On the Feasibility of Internet-Scale Author Identification ieee symposium on security and privacy. pp. 300- 314 ,(2012) , 10.1109/SP.2012.46
Nitin Jindal, Bing Liu, Ee-Peng Lim, Finding unusual review patterns using unexpected rules Proceedings of the 19th ACM international conference on Information and knowledge management - CIKM '10. pp. 1549- 1552 ,(2010) , 10.1145/1871437.1871669
Ahmed Abbasi, Hsinchun Chen, Writeprints ACM Transactions on Information Systems. ,vol. 26, pp. 1- 29 ,(2008) , 10.1145/1344411.1344413
Kushal Dave, Steve Lawrence, David M. Pennock, Mining the peanut gallery Proceedings of the twelfth international conference on World Wide Web - WWW '03. pp. 519- 528 ,(2003) , 10.1145/775152.775226