Effective Techniques for Message Reduction and Load Balancing in Distributed Graph Computation

Yan, Da; Cheng, James; Lu, Yi; Ng, Wilfred

doi:10.1145/2736277.2741096

Cited by 102 publications

(59 citation statements)

References 21 publications

(33 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In anticipation of these scenarios, here we design and implement a simple and flexible load sharing procedure which we hope can help to alleviate the level of stress of healthcare systems and implement and test with information for the UK National Health Service (NHS) and the Spanish health system. Graph-embedded load balancing [9,10] has been mainly explored in computer science, usually taking a 'vertex perspective' for graphical computation with the aim of achieving a centralised solution to load allocation, subject to locality and availability constraints [11]. Interestingly, this line usually relates to minimise large-scale computational efforts, rather than actually sharing physical resources.…”

Section: I-backgroundmentioning

confidence: 99%

A flexible method for optimising sharing of healthcare resources and demand in the context of the COVID-19 pandemic

Lacasa

Challen

Brooks-Pollock

et al. 2020

Preprint

View full text Add to dashboard Cite

As the number of cases of COVID-19 continues to grow exponentially, local health services are likely to be overwhelmed with patients requiring intensive care. We develop and implement an algorithm to provide optimal re-routing strategies to either transfer patients requiring Intensive Care Units (ICU) or ventilators, constrained by feasibility of transfer. We validate our approach with realistic data extracted from UK and Spain. For the UK case, we coarse-grain the NHS system at the level of NHS trusts and, subsequently cover the whole set of geopositioned trusts to extract a 4-regular geometric graph which indicates, for a given trust, its four nearest neighbors. The Spanish case is analysed at the autonomous community level, and we extract a contact network where nodes correspond to autonomous communities and links indicate adjacent communities. Estimates of weekly ICU demand could be extrapolated from an age structured epidemiological model by considering contagion-to-ICU likelihood estimates or alternatively from available data. Through random search optimisation we identify the best load sharing strategy, where the cost function to minimise is based on the total number of ICU units above capacity and we implement and test two optimisation strategies. Our framework is flexible allowing for additional criteria, different cost functions, and this methodology is general enough that it can easily be extended to optimise other resources beyond ICU units or ventilators. Assuming a uniform ICU demand across trusts, we show that using our method it is possible to enable access to ICU treatment to up to 1000 cases in the UK in a single step of the algorithm, and with more realistic demand the algorithm is able to balance about 600 beds per step in the Spanish system -leading to potentially saving a large percentage of these lives that would otherwise not have access to ICU if no load sharing was implemented.

show abstract

Section: I-backgroundmentioning

confidence: 99%

A flexible method for optimising sharing of healthcare resources and demand in the context of the COVID-19 pandemic

Lacasa

Challen

Brooks-Pollock

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

“…We test Pregel+'s basic implementation, Pregel+'s ghost mode (a.k.a. the mirroring technique [29]), the standard channel version ( Fig. 1) and the scatter-combine channel version.…”

Section: B Effectiveness Of Optimized Channelsmentioning

confidence: 99%

“…While Pregel provides a friendly interface for processing massive graphs, current research shows that it is important to introduce optimizations for dealing with various performance issues such as imbalanced workload (a.k.a. skewed degree distribution) [2], [8], [29], redundancies in communication [3], [19], [29] and low convergence speed [23], [24], [28]. However, there remains one challenge: although the usefulness of these optimizations are well demonstrated in solving simple algorithms such as PageRank and single-source shortest path (SSSP) 1 , it is, however, hard to combine them together to implement complex algorithms, where we may have to deal with multiple performance issues at the same time.…”

Section: Introductionmentioning

confidence: 99%

“…Essentially, it is an iterative algorithm with two key operations -pointer jumping and tree merging. For the pointer jumping operation, the communication suffers from imbalanced workload [29], and in the meantime in the tree merging operation, the neighborhood communication (every vertex broadcasts a message to all of its own neighbors) could be potentially very heavy. Although there are techniques in separate systems [3], [19], [29] dealing with each case, there is no system capable of optimizing away both issues at the same time.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Composing Optimization Techniques for Vertex-Centric Graph Processing via Communication Channels

Zhang

2019

2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

View full text Add to dashboard Cite

Pregel's vertex-centric model allows us to implement many interesting graph algorithms, where optimization plays an important role in making it practically useful. Although many optimizations have been developed for dealing with different performance issues, it is hard to compose them together to optimize complex algorithms, where we have to deal with multiple performance issues at the same time. In this paper, we propose a new approach to composing optimizations, by making use of the channel interface, as a replacement of Pregel's message passing and aggregator mechanism, which can better structure the communication in Pregel algorithms. We demonstrate that it is convenient to optimize a Pregel program by simply using a proper channel from the channel library or composing them to deal with multiple performance issues. We intensively evaluate the approach through many nontrivial examples. By adopting the channel interface, our system achieves an all-around performance gain for various graph algorithms. In particular, the composition of different optimizations makes the S-V algorithm 2.20x faster than the current best implementation.

show abstract

“…For example, MOCgraph , GraphD [Yan et al, 2016d], and the superstep-splitting technique of Giraph all propose aggregating messages earlier instead of buffering them for later processing, in order to save memory space; while PowerGraph , GraphChi and X-Stream [Roy et al, 2013] assume that data values are aggregated at each vertex from its incoming edges, in their model design. We, however, would like to indicate that not all algorithms with edge-based communication allow its vertices to aggregate received values, such as the attribute broadcast algorithm of Yan et al [2015].…”

Section: Expressivenessmentioning

confidence: 99%

Big Graph Analytics Platforms

Yan

Bu²,

Tian

et al. 2017

FNT in Databases

Self Cite

View full text Add to dashboard Cite

Effective Techniques for Message Reduction and Load Balancing in Distributed Graph Computation

Cited by 102 publications

References 21 publications

A flexible method for optimising sharing of healthcare resources and demand in the context of the COVID-19 pandemic

A flexible method for optimising sharing of healthcare resources and demand in the context of the COVID-19 pandemic

Composing Optimization Techniques for Vertex-Centric Graph Processing via Communication Channels

Big Graph Analytics Platforms

Contact Info

Product

Resources

About