Codes for the World Wide Web

作者:

DOI: 10.1080/15427951.2005.10129113

关键词:

摘要: We introduce a new family of simple, complete instantaneous codes for positive integers, called ζ codes, which are suitable integers distributed as power law with small exponent (smaller than 2). The main motivation the introduction comes from web-graph compression: if nodes numbered according to URL lexicographical order, gaps in successor lists exponent. give estimates expected length against power-law distributions, and compare results analogous more classical γ, δ variable-length block codes.

参考文章(10)
Alistair Moffat, Jyrki Katajainen, In-Place Calculation of Minimum-Redundancy Codes workshop on algorithms and data structures. pp. 393- 402 ,(1995) , 10.1007/3-540-60220-8_79
P. Boldi, S. Vigna, The webgraph framework I Proceedings of the 13th conference on World Wide Web - WWW '04. pp. 595- 602 ,(2004) , 10.1145/988672.988752
Paolo Boldi, Bruno Codenotti, Massimo Santini, Sebastiano Vigna, UbiCrawler: a scalable fully distributed web crawler Software - Practice and Experience. ,vol. 34, pp. 711- 726 ,(2004) , 10.1002/SPE.587
Oren Patashnik, Donald E. Knuth, Ronald L. Graham, Concrete Mathematics: A Foundation for Computer Science ,(1994)
M. Adler, M. Mitzenmacher, Towards compressing Web graphs data compression conference. pp. 203- 212 ,(2001) , 10.1109/DCC.2001.917151
P. Elias, Universal codeword sets and representations of the integers IEEE Transactions on Information Theory. ,vol. 21, pp. 194- 203 ,(1975) , 10.1109/TIT.1975.1055349
Jun Hirai, Sriram Raghavan, Hector Garcia-Molina, Andreas Paepcke, WebBase: a repository of Web pages the web conference. ,vol. 33, pp. 277- 293 ,(2000) , 10.1016/S1389-1286(00)00063-3
K.H. Randall, R. Stata, R.G. Wickremesinghe, J.L. Wiener, The Link Database: fast access to graphs of the Web data compression conference. pp. 122- 131 ,(2002) , 10.1109/DCC.2002.999950
Andrei Broder, Ravi Kumar, Farzin Maghoul, Prabhakar Raghavan, Sridhar Rajagopalan, Raymie Stata, Andrew Tomkins, Janet Wiener, Graph structure in the Web the web conference. ,vol. 33, pp. 309- 320 ,(2000) , 10.1016/S1389-1286(00)00083-9