Minimizing Communication in All-Pairs Shortest Paths

Solomonik, Edgar; Buluç, Aydın; Demmel, James

doi:10.21236/ada580350

Cited by 18 publications

(13 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In [19], Solomonik et al propose recursive 2D block-cyclic algorithm that achieves lower-bound on communication latency and bandwidth, which they next extend to 2.5D communication avoiding formulation. On a machine with 24,576 cores, the algorithms maintain strong scaling for problems with n = 32768 nodes, and weak scaling for up to n = 131072 nodes.…”

Section: Related Workmentioning

confidence: 99%

“…The paper is also noteworthy for its extensive review of distributed memory APSP solvers. The advantage of the recursive formulation is induced data locality, which directly contributes to improved performance [19]. However, in Spark, the concept of data locality is much weaker, since Spark's runtime system has a significant freedom in scheduling where to materialize or move data for computations.…”

Section: Related Workmentioning

confidence: 99%

“…The standard approach to handle APSP in such cases is to use a variant of classic Floyd-Warshall [5] or Johnson algorithms [5], with complexity O(|V | 3 ) and O(|V ||E| + |V | 2 log(|V |)), respectively. The Johnson algorithm offers better asymptotic behaviour for reasonably sparse graphs, but typically Floyd-Warshall derivatives outperform it as they allow for better computational density (see also [19]). When using Floyd-Warshall and related algorithms, we will represent adjacency matrix of G by A, where A i j = A ji = w i j stores the weight of edge between vertices with indices i and j.…”

Section: Preliminariesmentioning

confidence: 99%

“…The second method, abbreviated as DC-GbE, is highly optimized divide-and-conquer solver by Solomonik et al [19], available from https://github.com/solomonik/APSP. The solver has been demonstrated to scale extremely well to very large parallel machines, and is the state-of-the-art HPC solution.…”

Section: Comparison With Mpi-based Solversmentioning

confidence: 99%

See 3 more Smart Citations

Solving All-Pairs Shortest-Paths Problem in Large Graphs Using Apache Spark

Schoeneman

Żola

2019

Proceedings of the 48th International Conference on Parallel Processing

View full text Add to dashboard Cite

Algorithms for computing All-Pairs Shortest-Paths (APSP) are critical building blocks underlying many practical applications. The standard sequential algorithms, such as Floyd-Warshall and Johnson, quickly become infeasible for large input graphs, necessitating parallel approaches. In this work, we propose, implement and thoroughly analyse different strategies for APSP on distributed memory clusters with Apache Spark. Our solvers are designed for large undirected weighted graphs, and differ in complexity and degree of reliance on techniques outside of pure Spark API. We demonstrate that the best performing solver is able to handle APSP problems with over 200,000 vertices on a 1024-core cluster. However, it requires auxiliary shared persistent storage to compensate for missing Spark functionality.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Preliminariesmentioning

confidence: 99%

Section: Comparison With Mpi-based Solversmentioning

confidence: 99%

See 2 more Smart Citations

Solving All-Pairs Shortest-Paths Problem in Large Graphs Using Apache Spark

Schoeneman

Żola

2019

Proceedings of the 48th International Conference on Parallel Processing

View full text Add to dashboard Cite

show abstract

“…Irony et al [21] used a geometric reasoning with the Loomis-Whitney inequality [23] to present an alternate proof to Hong and Kung's [20] for I/O lower bounds on standard matrix multiplication. More recently, Demmel's group at UC Berkeley has developed lower bounds as well as optimal algorithms for several linear algebra computations including QR and LU decomposition and the all-pairs shortest paths problem [1,3,13,33].…”

Section: Related Workmentioning

confidence: 99%

On Characterizing the Data Access Complexity of Programs

et al. 2015

View full text Add to dashboard Cite

Technology trends will cause data movement to account for the majority of energy expenditure and execution time on emerging computers. Therefore, computational complexity will no longer be a sufficient metric for comparing algorithms, and a fundamental characterization of data access complexity will be increasingly important. The problem of developing lower bounds for data access complexity has been modeled using the formalism of Hong & Kung's red/blue pebble game for computational directed acyclic graphs (CDAGs). However, previously developed approaches to lower bounds analysis for the red/blue pebble game are very limited in effectiveness when applied to CDAGs of real programs, with computations comprised of multiple sub-computations with differing DAG structure. We address this problem by developing an approach for effectively composing lower bounds based on graph decomposition. We also develop a static analysis algorithm to derive the asymptotic data-access lower bounds of programs, as a function of the problem size and cache size.

show abstract