2013
DOI: 10.21236/ada580350
|View full text |Cite
|
Sign up to set email alerts
|

Minimizing Communication in All-Pairs Shortest Paths

Abstract: Abstract-We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling the APSP problem to high concurrencies requires both minimizing inter-processor communication as well as maximizing temporal data locality. The 2.5D APSP algorithm, which is based on the divide-andconquer paradigm, satisfies both of these requirements: it can utilize any extra available memory to perform asymptotically less communication, and it is rich in semiring matrix multiplications, which have high… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 18 publications
(13 citation statements)
references
References 17 publications
0
13
0
Order By: Relevance
“…In [19], Solomonik et al propose recursive 2D block-cyclic algorithm that achieves lower-bound on communication latency and bandwidth, which they next extend to 2.5D communication avoiding formulation. On a machine with 24,576 cores, the algorithms maintain strong scaling for problems with n = 32768 nodes, and weak scaling for up to n = 131072 nodes.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…In [19], Solomonik et al propose recursive 2D block-cyclic algorithm that achieves lower-bound on communication latency and bandwidth, which they next extend to 2.5D communication avoiding formulation. On a machine with 24,576 cores, the algorithms maintain strong scaling for problems with n = 32768 nodes, and weak scaling for up to n = 131072 nodes.…”
Section: Related Workmentioning
confidence: 99%
“…The paper is also noteworthy for its extensive review of distributed memory APSP solvers. The advantage of the recursive formulation is induced data locality, which directly contributes to improved performance [19]. However, in Spark, the concept of data locality is much weaker, since Spark's runtime system has a significant freedom in scheduling where to materialize or move data for computations.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Irony et al [21] used a geometric reasoning with the Loomis-Whitney inequality [23] to present an alternate proof to Hong and Kung's [20] for I/O lower bounds on standard matrix multiplication. More recently, Demmel's group at UC Berkeley has developed lower bounds as well as optimal algorithms for several linear algebra computations including QR and LU decomposition and the all-pairs shortest paths problem [1,3,13,33].…”
Section: Related Workmentioning
confidence: 99%