Compact representation of Web graphs with extended functionality

作者: Nieves R. Brisaboa , Susana Ladra , Gonzalo Navarro

DOI: 10.1016/J.IS.2013.08.003

关键词:

摘要: The representation of large subsets the World Wide Web in form a directed graph has been extensively used to analyze structure, behavior, and evolution those so-called graphs. However, interesting graphs are very their classical representations do not fit into main memory typical computers, whereas required algorithms perform inefficiently on secondary memory. Compressed drastically reduce space requirements while allowing efficient navigation compressed form. While most basic operation is retrieve successors node, several important require support for extended queries, such as finding predecessors checking presence link, or retrieving links between ranges nodes. Those seldom supported by representations. This paper presents k^2-tree, novel based compact tree structure that takes advantage empty areas adjacency matrix graph. only retrieves symmetric fashion, but also it particularly check specific nodes, list ranges. Compared best literature supporting successor predecessor our technique offers least usage (1-3 bits per link) fast ([email protected] neighbor retrieved) sharply outperforming others queries. general interest can be compress other kinds data structures.

参考文章(56)
Stefano Millozzi, Stefano Leonardi, Debora Donato, Panayiotis Tsaparas, Mining the inner structure of the Web graph. international workshop on the web and databases. pp. 145- 150 ,(2005)
Károly Csalogány, András A. Benczúr, Tamás Sarlós, Máté Uher, SpamRank -- Fully Automatic Link Spam Detection. adversarial information retrieval on the web. pp. 25- 38 ,(2005)
Simon Gog, Matthias Petri, Optimized succinct data structures for massive data Software - Practice and Experience. ,vol. 44, pp. 1287- 1314 ,(2014) , 10.1002/SPE.2198
Diego Arroyuelo, Kunihiko Sadakane, Rodrigo Cánovas, Gonzalo Navarro, Succinct trees in practice algorithm engineering and experimentation. pp. 84- 97 ,(2010)
Nieves R. Brisaboa, Susana Ladra, Gonzalo Navarro, k2-Trees for Compact Web Graph Representation string processing and information retrieval. ,vol. 5721, pp. 18- 30 ,(2009) , 10.1007/978-3-642-03784-9_3
Francisco Claude, Gonzalo Navarro, Extended compact web graph representations Lecture Notes in Computer Science. ,vol. 6060, pp. 77- 91 ,(2010) , 10.1007/978-3-642-12476-1_5
Nieves R. Brisaboa, Rodrigo Cánovas, Francisco Claude, Miguel A. Martínez-Prieto, Gonzalo Navarro, Compressed string dictionaries symposium on experimental and efficient algorithms. pp. 136- 147 ,(2011) , 10.1007/978-3-642-20662-7_12
Jérémy Barbay, Francisco Claude, Gonzalo Navarro, Compact rich-functional binary relation representations latin american symposium on theoretical informatics. pp. 170- 183 ,(2010) , 10.1007/978-3-642-12200-2_17