AMG based on compatible weighted matching for GPUs

Bernaschi, Massimo; D'Ambra, Pasqua; Pasquini, Dario

doi:10.1016/j.parco.2019.102599

Cited by 12 publications

(9 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The method, named coarsening based on compatible weighted matching was first introduced in [22] and is already available in the sequential package described in [21]. A first parallel version of the method, exploiting fine-grained parallelism and specifically tailored for single GPU device is described in [7,8]. The method is independent of any heuristics or a priori information on the near kernel of A, i.e., the lower part of the range of eigenvalues of the system matrix A which is generally used to obtain good-quality aggregates, and it is a completely automatic procedure applicable to general s.p.d.…”

Section: Parallel Aggregation Based On Weighted Graph Matchingmentioning

confidence: 99%

AMG Preconditioners for Linear Solvers towards Extreme Scale

D'Ambra¹,

Durastante²,

Filippone³

2021

SIAM J. Sci. Comput.

Self Cite

View full text Add to dashboard Cite

Linear solvers for large and sparse systems are a key element of scientific applications, and their efficient implementation is necessary to harness the computational power of current computers. Algebraic MultiGrid (AMG) preconditioners are a popular ingredient of such linear solvers; this is the motivation for the present work where we examine some recent developments in a package of AMG preconditioners to improve efficiency, scalability and robustness on extreme scale problems. The main novelty is the design and implementation of a parallel coarsening algorithm based on aggregation of unknowns employing weighted graph matching techniques; this is a completely automated procedure, requiring no information from the user, and applicable to general symmetric positive definite (s.p.d.) matrices. The new coarsening algorithm improves in terms of numerical scalability at low operator complexity over decoupled aggregation algorithms available in previous releases of the package. The preconditioners package is built on the parallel software framework PSBLAS, which has also been updated to progress towards exascale. We present weak scalability results on one of the most powerful supercomputer in Europe for linear systems with sizes up to O(10 10 ) unknowns.

show abstract

Section: Parallel Aggregation Based On Weighted Graph Matchingmentioning

confidence: 99%

AMG Preconditioners for Linear Solvers towards Extreme Scale

D'Ambra¹,

Durastante²,

Filippone³

2021

SIAM J. Sci. Comput.

Self Cite

View full text Add to dashboard Cite

show abstract

“…The new power-to-solution metrics requires a rethinking of many computational kernels of HPC applications looking for a trade-off between the reduction of the total energy and the minimization of the time-to-solution, promoting scalability. Within this context, extensions and improvements of high-performance algorithms and SW libraries for kernels in numerical linear algebra [44], [45] and graph computation, such as iterative [46], [47], [48], [49] and direct linear solvers, edge weighted graph matching, and fast multipole methods [50] will be deployed.…”

Section: Mathlibmentioning

confidence: 99%

TEXTAROSSA: Towards EXtreme scale Technologies and Accelerators for euROhpc hw/Sw Supercomputing Applications for exascale

Agosta

Cattaneo

Fornaciari

et al. 2021

2021 24th Euromicro Conference on Digital System Design (DSD)

Self Cite

View full text Add to dashboard Cite

“…Its parallelization does not present particular difficulties as mainly relies on standard sparse linear algebra operations such as sparse matrix by matrix and matrix by vector products or other basic tasks which are nowadays readily handled by the highly optimized Basic Linear Algebra Operations (BLAS) routines [5]. The only two tasks that present some difficulties from the parallelization viewpoint are the smoother set-up and application [2] and the coarsening stage [4,3], which have been deeply investigated by several authors in recent years.…”

Section: Special Features Available In Chronos To Increase Performancementioning

confidence: 99%

A General-Purpose AMG Linear Solver for High Performance Computing

Isotton¹,

Frigo²,

Spiezia³

et al. 2021

14th WCCM-ECCOMAS Congress

View full text Add to dashboard Cite

The numerical simulation of modern engineering problems via finite elements requires the solution of sparse linear systems of millions or even billions of unknowns. The algebraic multigrid (AMG) methods are the most common choice as linear solvers because of their fast convergence even for large-size problems. In this communication, we propose Chronos, a massively parallel implementation of a novel AMG framework, specifically designed to address complex problems by adapting its components, from the smoother, to the coarse grid correction and prolongation to the problem at hand. This work demonstrates not only the numerical performance of the proposed library, but also its robustness and adaptability to very challenging matrices, arising from different fields of application.

show abstract

AMG based on compatible weighted matching for GPUs

Cited by 12 publications

References 26 publications

AMG Preconditioners for Linear Solvers towards Extreme Scale

AMG Preconditioners for Linear Solvers towards Extreme Scale

TEXTAROSSA: Towards EXtreme scale Technologies and Accelerators for euROhpc hw/Sw Supercomputing Applications for exascale

A General-Purpose AMG Linear Solver for High Performance Computing

Contact Info

Product

Resources

About