Numerically Stable Recurrence Relations for the Communication Hiding Pipelined Conjugate Gradient Method

Cools, Siegfried; Cornelis, Jeffrey; Vanroose, Wim

doi:10.48550/arxiv.1902.03100

Cited by 1 publication

(4 citation statements)

References 0 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The corresponding dot-products on line 23 are defined as M-inner products in the context of the preconditioned algorithm, see [8] for details.…”

Section: Technical Comments On the Algorithmmentioning

confidence: 99%

“…Numerical round-off errors may increase the occurrence of these breakdowns in practice, cf. [8,11]. When a breakdown occurs the p(l)-CG iteration is restarted explicitly.…”

Section: Technical Comments On the Algorithmmentioning

confidence: 99%

“…that are well established for the original Krylov algorithms. Research on analyzing and improving the numerical stability of pipelined Krylov subspace methods has recently been performed in [4,7,8,10]; we point out the references therein for more information on the numerical properties of Krylov subspace methods.…”

Section: Introductionmentioning

confidence: 99%

“…Strong scaling results obtained from an MPI-based implementation of the l-length pipelined Conjugate Gradient method, p(l)-CG for short, are presented in this work. The p(l)-CG method was recently presented in [8,11] and allows to overlap each global reduction phase with the communication and computational work of l subsequent iterations. This overlap is achieved by exploiting the asynchronous non-blocking global "Iallreduce" MPI operations that have been available from the MPI3 standard onward.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Improving strong scaling of the Conjugate Gradient method for solving large linear systems using global reduction pipelining

Cools¹,

Cornelis²,

Ghysels³

et al. 2019

Preprint

Self Cite

View full text Add to dashboard Cite

Figure 1: Schematic representation of global reduction pipelining in Krylov subspace methods (e.g. Conjugate Gradients) for pipeline length two (l = 2). Global communication is initiated by an MPI_Iallreduce call. The reduction overlaps with the global communication and computational kernels in the next two iterations and is finalized by MPI_Wait. Optimally, a theoretical O(l) speedup over classic Krylov subspace methods is achieved.

show abstract

“…The corresponding dot-products on line 23 are defined as M-inner products in the context of the preconditioned algorithm, see [8] for details.…”

Section: Technical Comments On the Algorithmmentioning

confidence: 99%

“…Numerical round-off errors may increase the occurrence of these breakdowns in practice, cf. [8,11]. When a breakdown occurs the p(l)-CG iteration is restarted explicitly.…”

Section: Technical Comments On the Algorithmmentioning

confidence: 99%