Proceedings of the 24th International Conference on World Wide Web 2015
DOI: 10.1145/2736277.2741096
|View full text |Cite
|
Sign up to set email alerts
|

Effective Techniques for Message Reduction and Load Balancing in Distributed Graph Computation

Abstract: Massive graphs, such as online social networks and communication networks, have become common today. To efficiently analyze such large graphs, many distributed graph computing systems have been developed. These systems employ the "think like a vertex" programming paradigm, where a program proceeds in iterations and at each iteration, vertices exchange messages with each other. However, using Pregel's simple message passing mechanism, some vertices may send/receive significantly more messages than others due to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
59
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
1
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 102 publications
(59 citation statements)
references
References 21 publications
(33 reference statements)
0
59
0
Order By: Relevance
“…In anticipation of these scenarios, here we design and implement a simple and flexible load sharing procedure which we hope can help to alleviate the level of stress of healthcare systems and implement and test with information for the UK National Health Service (NHS) and the Spanish health system. Graph-embedded load balancing [9,10] has been mainly explored in computer science, usually taking a 'vertex perspective' for graphical computation with the aim of achieving a centralised solution to load allocation, subject to locality and availability constraints [11]. Interestingly, this line usually relates to minimise large-scale computational efforts, rather than actually sharing physical resources.…”
Section: I-backgroundmentioning
confidence: 99%
“…In anticipation of these scenarios, here we design and implement a simple and flexible load sharing procedure which we hope can help to alleviate the level of stress of healthcare systems and implement and test with information for the UK National Health Service (NHS) and the Spanish health system. Graph-embedded load balancing [9,10] has been mainly explored in computer science, usually taking a 'vertex perspective' for graphical computation with the aim of achieving a centralised solution to load allocation, subject to locality and availability constraints [11]. Interestingly, this line usually relates to minimise large-scale computational efforts, rather than actually sharing physical resources.…”
Section: I-backgroundmentioning
confidence: 99%
“…We test Pregel+'s basic implementation, Pregel+'s ghost mode (a.k.a. the mirroring technique [29]), the standard channel version ( Fig. 1) and the scatter-combine channel version.…”
Section: B Effectiveness Of Optimized Channelsmentioning
confidence: 99%
“…While Pregel provides a friendly interface for processing massive graphs, current research shows that it is important to introduce optimizations for dealing with various performance issues such as imbalanced workload (a.k.a. skewed degree distribution) [2], [8], [29], redundancies in communication [3], [19], [29] and low convergence speed [23], [24], [28]. However, there remains one challenge: although the usefulness of these optimizations are well demonstrated in solving simple algorithms such as PageRank and single-source shortest path (SSSP) 1 , it is, however, hard to combine them together to implement complex algorithms, where we may have to deal with multiple performance issues at the same time.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, MOCgraph , GraphD [Yan et al, 2016d], and the superstep-splitting technique of Giraph all propose aggregating messages earlier instead of buffering them for later processing, in order to save memory space; while PowerGraph , GraphChi and X-Stream [Roy et al, 2013] assume that data values are aggregated at each vertex from its incoming edges, in their model design. We, however, would like to indicate that not all algorithms with edge-based communication allow its vertices to aggregate received values, such as the attribute broadcast algorithm of Yan et al [2015].…”
Section: Expressivenessmentioning
confidence: 99%