Detection of undesirable web pages

作者: Kostas Tsioutsiouliklis , Dmitri Pavlovski , Su Han Chan , Lei Duan , Gilbert Leung

DOI:

关键词: ComputationSearch engineLog dataData miningMathematicsPartition (database)StatisticBacklinkTheoretical computer scienceWeb search queryWeb page

摘要: A system for detecting artificial promotion of a resource, including search engine operative to index set incoming links (“inlinks”) which reference the log module coupled with and configured store data associated inlinks, partitioning partition inlinks into plurality groups based on at least one scheme, statistics compute statistic within each computation process computed metric where indicates level uniformity distribution values respective among places list results, generated in response query, pattern metric.

参考文章(11)
Brian D. Davia, Stephen Michael McKain, Colin E. Birge, Paul W. Coleman, Method for web page rules compliance testing ,(2003)
Alexandros Ntoulas, Dennis Craig Fetterly, Marc Alexander Najork, Mark Steven Manasse, Using content analysis to detect spam web pages ,(2005)
Rongbo Du, R. Safavi-Naini, W. Susilo, Web filtering using text classification international conference on networks. pp. 325- 330 ,(2003) , 10.1109/ICON.2003.1266211
Jennifer Tour Chayes, Seyed Vahab Mirrokni, Christian Herwarth Borgs, John E Hopcroft, Shang-Hua Teng, Reid Marlow Andersen, Jain Kamal, Amit Prakash, Locally computable spam detection features and robust pagerank ,(2008)
Zoltan Istvan Gyongyi, Pavel Berkhin, Jan Pedersen, Link-based spam detection ,(2005)
Taher H. Haveliwala, Sepandar D. Kamvar, Glen M. Jeh, Method for detecting link spam in hyperlinked databases ,(2011)
Michael A. Paolini, Michael Wayne Brown, Kelvin Roderick Lawrence, Web page thumbnails and user configured complementary information provided from a server ,(2003)
Tie-Yan Liu, Hang Li, Congkai Sun, Bin Gao, Forum mining for suspicious link spam sites detection ,(2008)
Harry R. Halpin, Henry S. Thompson, Distributed human improvement of search engine results ,(2007)