2011
DOI: 10.1007/978-3-642-21286-4_5
|View full text |Cite
|
Sign up to set email alerts
|

Quick Detection of Top-k Personalized PageRank Lists

Abstract: Abstract. We study a problem of quick detection of top-k Personalized PageRank (PPR) lists. This problem has a number of important applications such as finding local cuts in large graphs, estimation of similarity distance and person name disambiguation. We argue that two observations are important when finding top-k PPR lists. Firstly, it is crucial that we detect fast the top-k most important neighbors of a node, while the exact order in the top-k list and the exact values of PPR are by far not so crucial. Se… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
36
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 44 publications
(36 citation statements)
references
References 19 publications
0
36
0
Order By: Relevance
“…Moreover, we also show that -these can be obtained highly efficiently, if necessary, leveraging existing approximation algorithms [2,4,14,17,21,23,41] and/or parallel implementations [3,32] for computing the PPR scores, -the proposed formulations are reuse-promoting in the sense that, it is possible to divide the work relative to individual seed nodes and cache the intermediary results obtained during the computation -these cached results can then be reused for future queries sharing seed nodes, and -especially in systems with large query throughputs, it may be possible to cluster queries based on the partial overlaps between the seed sets and, thus, significantly reduce the overall robust PPR computation costs.…”
Section: Our Contributions: Robust Personalized Pagerank (Rpr)mentioning
confidence: 99%
See 2 more Smart Citations
“…Moreover, we also show that -these can be obtained highly efficiently, if necessary, leveraging existing approximation algorithms [2,4,14,17,21,23,41] and/or parallel implementations [3,32] for computing the PPR scores, -the proposed formulations are reuse-promoting in the sense that, it is possible to divide the work relative to individual seed nodes and cache the intermediary results obtained during the computation -these cached results can then be reused for future queries sharing seed nodes, and -especially in systems with large query throughputs, it may be possible to cluster queries based on the partial overlaps between the seed sets and, thus, significantly reduce the overall robust PPR computation costs.…”
Section: Our Contributions: Robust Personalized Pagerank (Rpr)mentioning
confidence: 99%
“…This is especially advantageous when G is large as we can leverage any of the highly effective approximation algorithms [2,4,14,17,21,23,41] or parallelized implementations [3,32] for computing these PPR scores. Most importantly, the first step of the algorithm (where we solve a linear equation independently for each seed node) can be trivially parallelized by assigning each node to a different computation unit.…”
Section: Converting the Problem Into A Set Of Linear Equationsmentioning
confidence: 99%
See 1 more Smart Citation
“…The number n = |V | (m = |E|) of nodes (edges) of each graph are shown in Table 2. Web-stanford-cs 3 and Web-stanford 4 were crawled from stanford.edu. Each node is a web domain and a directed link stands for a hyperlink between two nodes.…”
Section: Datasetsmentioning
confidence: 99%
“…The top-k RWR proximity query retrieves the k nodes with the highest proximity from a given query node q in a graph. This problem has been investigated previously and efficient solutions have been proposed for it (e.g., [11,3,10]). …”
Section: Introductionmentioning
confidence: 99%