Finding and fighting search engine spam

作者: Baoning Wu

DOI:

关键词:

摘要: Web surfers rely on search engines to find information from the web. Search engine spam is attempt deceive ranking algorithms and considered by experts well-known companies be one of major challenges today. Without taking action, results will greatly harmed. This dissertation explores in detail effective solutions some techniques, such as link farms cloaking. Our approaches can effectively nullify farm effect our precision outperforms standard based algorithm, HITS, more than 200%. approach detecting cloaking behavior achieve an accuracy 90.5% a recall 86.8%. In addition, this studies idea combining topicality with trust demote spam. Experimental show that we up 43% TrustRank. We also investigate method authority improve quality. Experimental indicate significantly quality

参考文章(107)
Malik Magdon-Ismail, Sibel Adali, Tina Liu, Optimal Link Bombs are Uncoordinated. adversarial information retrieval on the web. pp. 58- 69 ,(2005)
Baoning Wu, Brian D. Davison, Cloaking and Redirection: A Preliminary Study. adversarial information retrieval on the web. pp. 7- 16 ,(2005)
Ricardo A. Baeza-Yates, Carlos Castillo, Vicente López, Pagerank Increase under Different Collusion Topologies. adversarial information retrieval on the web. pp. 17- 24 ,(2005)
Pranam Kolari, Akshay Java, Tim Finin, Anupam Joshi, Tim Oates, Detecting spam blogs: a machine learning approach national conference on artificial intelligence. pp. 1351- 1356 ,(2006) , 10.13016/M27M0444D
Mitsunori Ogihara, Mohammed J. Zaki, Theoretical Foundations of Association Rules ,(2007)
Pranam Kolari, Tim Finin, Anupam Joshi, SVMs for the Blogosphere: Blog Identification and Splog Detection national conference on artificial intelligence. pp. 92- 99 ,(2006)
Vinay Goel, Baoning Wu, Brian D. Davison, Propagating Trust and Distrust to Demote Web Spam. MTW. ,(2006)
Károly Csalogány, András A. Benczúr, Tamás Sarlós, Máté Uher, SpamRank -- Fully Automatic Link Spam Detection. adversarial information retrieval on the web. pp. 25- 38 ,(2005)
Bernard J. Jansen, Adversarial Information Retrieval Aspects of Sponsored Search. adversarial information retrieval on the web. pp. 33- 36 ,(2006)