SPAI Preconditioners for HPC Applications

Sawyer, William; Vanini, Carlo; Fourestey, Gilles; Popescu, Radu

doi:10.1002/pamm.201210314

Cited by 3 publications

(1 citation statement)

References 4 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A substantial amount of research has been conducted on various preconditioning techniques for iterative solvers on GPUs including algebraic multigrid [Bell et al 2012;Gandham et al 2014;Richter et al 2014;Wagner et al 2012], incomplete factorizations [Li and Saad 2013;Naumov 2012], or sparse approximate inverses [Dehnavi et al 2013;Lukash et al 2012;Sawyer et al 2012]. Nevertheless, hardware-efficient and scalable black-box preconditioners for GPUs are not available, but instead the use of problem-specific information is required [Yokota et al 2011].…”

Section: Introductionmentioning

confidence: 99%

Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units

Rupp

Weinbub

Jüngel

2016

ACM Trans. Math. Softw.

View full text Add to dashboard Cite

We revisit the implementation of iterative solvers on discrete graphics processing units and demonstrate the benefit of implementations using extensive kernel fusion for pipelined formulations over conventional implementations of classical formulations. The proposed implementations with both CUDA and OpenCL are freely available in ViennaCL and are shown to be competitive with or even superior to other solver packages for graphics processing units. Highest performance gains are obtained for small to medium-sized systems, while our implementations are on par with vendor-tuned implementations for very large systems. Our results are especially beneficial for transient problems, where many small to medium-sized systems instead of a single big system need to be solved.

show abstract