On the Parallel I/O Optimality of Linear Algebra Kernels: Near-Optimal LU Factorization
Grzegorz Kwasniewski,
Tal Ben-Nun,
Alexandros Nikolaos Ziogas
et al.
Abstract:Dense linear algebra kernels, such as linear solvers or tensor contractions, are fundamental components of many scientific computing applications. In this work we present a novel method of deriving parallel I/O lower bounds for this broad family of programs. Based on the π -Partitioning abstraction, our method explicitly captures inter-statement dependencies. Applying our analysis to LU factorization, we derive COnf LUX, an LU algorithm with the parallel I/O cost of π 3 /(π β π) communicated elements per p… Show more
Set email alert for when this publication receives citations?
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citationsβcitations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.