Proceedings DCC 2001. Data Compression Conference
DOI: 10.1109/dcc.2001.917151
|View full text |Cite
|
Sign up to set email alerts
|

Towards compressing Web graphs

Abstract: In this paper, we consider the problem of compressing graphs of the link structure of the World Wide Web. We provide e cient algorithms for such compression that are motivated by recently proposed random graph models for describing the Web. The algorithms are based on reducing the compression problem to the problem of nding a minimum spanning tree in a directed graph related to the original link graph. The performance of the algorithms on graphs generated by the random graph models suggests that by taking adva… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
115
0

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 148 publications
(115 citation statements)
references
References 13 publications
0
115
0
Order By: Relevance
“…The structural connectivity of the Web modeled as the Web graph is an example which presently contains billions of vertices and the number is growing [1]. As a result, compact representation of such graphs for use in various algorithms has been in interest [2,3,4,5]. Planar (and almost planar) graphs which capture various structural artifacts such as road networks, form another example of graphs whose space-efficient representation is crucial due to their massive size.…”
Section: Introductionmentioning
confidence: 99%
“…The structural connectivity of the Web modeled as the Web graph is an example which presently contains billions of vertices and the number is growing [1]. As a result, compact representation of such graphs for use in various algorithms has been in interest [2,3,4,5]. Planar (and almost planar) graphs which capture various structural artifacts such as road networks, form another example of graphs whose space-efficient representation is crucial due to their massive size.…”
Section: Introductionmentioning
confidence: 99%
“…Most known methods for structural (graph) compression are of heuristic nature. For example, Adler and Mitzenmacher [1] proposed a heuristic method for web graph compression, and similar idea has been used in [21] for compressing sparse graphs. Recently, attention has shifted to grammar compression for data structures: Peshkin [16] proposed an algorithm for a graphical extension of the one-dimensional SE-QUITUR compression method.…”
Section: Introductionmentioning
confidence: 99%
“…1 In 1990, Naor [15] proposed such a representation that is optimal up to the first two leading terms when all unlabeled graphs are equally likely. In this paper, we solve Turan's problem for a larger class of graphs, in particular for the Erdős-Rényi random graphs in which edges are added randomly with probability p. Naor's result is asymptotically a special case of ours when p = 1/2.…”
Section: Introductionmentioning
confidence: 99%
“…The WebGraph compression method is indeed the most successful member of a family of approaches to compress Web graphs based on their statistical properties [5,7,1,23,21,20]. It allows fast extraction of the neighbors of a page while spending just a few bits per link (about 2 to 6, depending on the desired navigation performance).…”
Section: Introductionmentioning
confidence: 99%