Proceedings of the Eleventh International Conference on Information and Knowledge Management 2002
DOI: 10.1145/584792.584882
|View full text |Cite
|
Sign up to set email alerts
|

I/O-efficient techniques for computing pagerank

Abstract: Over the last few years, most major search engines have integrated link-based ranking techniques in order to provide more accurate search results. One widely known approach is the Pagerank technique, which forms the basis of the Google ranking scheme, and which assigns a global importance measure to each page based on the importance of other pages pointing to it. The main advantage of the Pagerank measure is that it is independent of the query posed by a user; this means that it can be precomputed and then use… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
28
0

Year Published

2004
2004
2011
2011

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 47 publications
(29 citation statements)
references
References 48 publications
(49 reference statements)
1
28
0
Order By: Relevance
“…Because pages only link to a few others (the link matrix is sparse), this results in much lower memory requirements of the link structure, in the magnitude of | L | · n −1 (n = average outdegree). Of course, compression techniques [14] or disk-based "swapping" [9,5] can improve the space requirements even further. But with the permanent growth of the web, even such techniques will soon hit memory limits of a single computer, or unacceptably slow down the computation process.…”
Section: Web Graph Representationmentioning
confidence: 99%
See 2 more Smart Citations
“…Because pages only link to a few others (the link matrix is sparse), this results in much lower memory requirements of the link structure, in the magnitude of | L | · n −1 (n = average outdegree). Of course, compression techniques [14] or disk-based "swapping" [9,5] can improve the space requirements even further. But with the permanent growth of the web, even such techniques will soon hit memory limits of a single computer, or unacceptably slow down the computation process.…”
Section: Web Graph Representationmentioning
confidence: 99%
“…We can therefore also write Equation 1 in matrix terms as follows: Equation 1 also represents the linear system representation of this computation using the Jacobi method. This enables the consideration of using other stationary iterative solvers, such as the Gauß-Seidel method, which was said not to be efficiently parallelizable here [1,5]. Actually, there already are parallel Gauss-Seidel implementations for certain scenarios such as the one described in [13], using block-diagonally-bordered matrices; however, they all admit their approach was designed for a static matrix; after each modification, a specific preprocessing (sorting) step is required, which can take longer than the real computation.…”
Section: Pagerankmentioning
confidence: 99%
See 1 more Smart Citation
“…Finally, TSE reads D = 2F bytes and maintains Ω = pF , without ever requiring that all supporter counts fit in RAM. Note that since both BV and QVS are iterative methods similar to PageRank, their Ω can be reduced at the expense of certain offline pre-processing on the graph [8]. We do not consider this in our comparison.…”
Section: External Memorymentioning
confidence: 99%
“…However, as pointed out by [6], even the best compression scheme requires about .6 bytes per hyperlink, which still results in an exceedingly large space requirement. Others are to design I/O-efficient algorithms, such as [9,10], which can handle any size of data without any particular memory requirement. Also aiming at reducing the cost of each iteration, many other works, such as [11,12], combine some linear algebra techniques to reduce the computational cost of each iteration.…”
Section: Related Workmentioning
confidence: 99%