Proceedings DCC 2002. Data Compression Conference
DOI: 10.1109/dcc.2002.999950
|View full text |Cite
|
Sign up to set email alerts
|

The Link Database: fast access to graphs of the Web

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
51
0

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 66 publications
(52 citation statements)
references
References 10 publications
1
51
0
Order By: Relevance
“…It can be easily seen that including dangling nodes does not add any storage overhead since a dangling node does not appear in the link file as a source id. Thus, our approach does not change the I/O cost model in (6).…”
Section: Dangling Nodes Treatmentmentioning
confidence: 99%
See 2 more Smart Citations
“…It can be easily seen that including dangling nodes does not add any storage overhead since a dangling node does not appear in the link file as a source id. Thus, our approach does not change the I/O cost model in (6).…”
Section: Dangling Nodes Treatmentmentioning
confidence: 99%
“…Because the major obstacle of speeding up each iteration lies in the size of data, it is natural to compress the data such that it fits into the main memory, as by [6,7,8]. However, as pointed out by [6], even the best compression scheme requires about .6 bytes per hyperlink, which still results in an exceedingly large space requirement. Others are to design I/O-efficient algorithms, such as [9,10], which can handle any size of data without any particular memory requirement.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The WebGraph compression method is indeed the most successful member of a family of approaches to compress Web graphs based on their statistical properties [5,7,1,23,21,20]. It allows fast extraction of the neighbors of a page while spending just a few bits per link (about 2 to 6, depending on the desired navigation performance).…”
Section: Introductionmentioning
confidence: 99%
“…Since disk access is (by five orders of magnitude) slower than main memory access, this leads to unacceptable retrieval times. To mitigate this problem, several compression techniques have been proposed for large graphs, aimed at reducing the number of bits per link required in graph representation [1][2][3][4][5][6][7].…”
Section: Introductionmentioning
confidence: 99%