Nonlinear Algorithms for Static-Rank Computation

作者: Ji-Rong Wen , Yunxiao Ma , Shuming Shi , Bin Lu

DOI:

关键词: Nonlinear systemComputer scienceNonlinear functional analysisPageRankLinear combinationRank (linear algebra)Statistical modelProbabilistic analysis of algorithmsTheoretical computer scienceLinear system

摘要: Mainstream link-based static-rank algorithms (e.g. PageRank and its variants) express the importance of a page as linear combination in-links compute scores by solving system in an iterative way. Such algorithms, however, may give apparently unreasonable staticrank results for some link structures. In this paper, we examine computation problem from viewpoint evidence build probabilistic model it. Based on model, argue that nonlinear formula should be adopted, due to correlation or dependence between links. We focus examining simple formulas which only consider links same domain. Experiments conducted 100 million web pages (with multiple quality evaluation metrics) show higher could yielded new algorithms. The convergence is also proved paper functional analysis.

参考文章(17)
Taher Haveliwala, Sepandar Kamvar, Gene Golub, Christopher Manning, Exploiting the Block Structure of the Web for Computing PageRank Stanford. ,(2003)
Zoltán Gyöngyi, Hector Garcia-Molina, Jan Pedersen, Combating web spam with trustrank very large data bases. pp. 576- 587 ,(2004) , 10.1016/B978-012088469-8.50052-8
Jon M. Kleinberg, Authoritative sources in a hyperlinked environment symposium on discrete algorithms. pp. 668- 677 ,(1998) , 10.5555/314613.315045
Kalervo Järvelin, Jaana Kekäläinen, IR evaluation methods for retrieving highly relevant documents international acm sigir conference on research and development in information retrieval. ,vol. 51, pp. 41- 48 ,(2000) , 10.1145/3130348.3130374
Gui-Rong Xue, Qiang Yang, Hua-Jun Zeng, Yong Yu, Zheng Chen, Exploiting the hierarchical structure for link analysis international acm sigir conference on research and development in information retrieval. pp. 186- 193 ,(2005) , 10.1145/1076034.1076068
Panayiotis Tsaparas, Using non-linear dynamical systems for web searching and ranking Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems - PODS '04. pp. 59- 70 ,(2004) , 10.1145/1055558.1055569
Matthew Richardson, Pedro Domingos, The Intelligent surfer: Probabilistic Combination of Link and Content Information in PageRank neural information processing systems. ,vol. 14, pp. 1441- 1448 ,(2001)
Maurice G. Kendall, Rank correlation methods ,(1948)
A Survey on PageRank Computing Internet Mathematics. ,vol. 2, pp. 73- 120 ,(2005) , 10.1080/15427951.2005.10129098
Baoning Wu, Brian D. Davison, Identifying link farm spam pages the web conference. pp. 820- 829 ,(2005) , 10.1145/1062745.1062762