2020
DOI: 10.48550/arxiv.2010.05975
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

On the Parallel I/O Optimality of Linear Algebra Kernels: Near-Optimal LU Factorization

Grzegorz Kwasniewski,
Tal Ben-Nun,
Alexandros Nikolaos Ziogas
et al.

Abstract: Dense linear algebra kernels, such as linear solvers or tensor contractions, are fundamental components of many scientific computing applications. In this work we present a novel method of deriving parallel I/O lower bounds for this broad family of programs. Based on the 𝑋 -Partitioning abstraction, our method explicitly captures inter-statement dependencies. Applying our analysis to LU factorization, we derive COnf LUX, an LU algorithm with the parallel I/O cost of 𝑁 3 /(𝑃 √ 𝑀) communicated elements per p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 39 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?