2010
DOI: 10.1016/j.parco.2009.12.003
|View full text |Cite
|
Sign up to set email alerts
|

GPU computing with Kaczmarz’s and other iterative algorithms for linear systems

Abstract: The graphics processing unit (GPU) is used to solve large linear systems derived from partial differential equations. The differential equations studied are strongly convection-dominated, of various sizes, and common to many fields, including computational fluid dynamics, heat transfer, and structural mechanics. The paper presents comparisons between GPU and CPU implementations of several well-known iterative methods, including Kaczmarz's, Cimmino's, component averaging, conjugate gradient normal residual (CGN… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
31
0
3

Year Published

2011
2011
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 63 publications
(35 citation statements)
references
References 19 publications
1
31
0
3
Order By: Relevance
“…These iterative algorithms are distributed by averaging the boundary information, e.g., component averaging (CAV) (Censor et al, 2001) and component averaged row projections (CARP) (Gordon and Gordon, 2005). A survey paper comparing various block parallel methods based on their performance on GPU's is (Elble et al, 2010). Kamath et al (2013) proposed an algorithm called component average distributed multi-resolution evolving tomography (CA-DMET) which involved modification of component average type algorithms such as CAV and CARP for seismic tomography.…”
Section: Related Workmentioning
confidence: 99%
“…These iterative algorithms are distributed by averaging the boundary information, e.g., component averaging (CAV) (Censor et al, 2001) and component averaged row projections (CARP) (Gordon and Gordon, 2005). A survey paper comparing various block parallel methods based on their performance on GPU's is (Elble et al, 2010). Kamath et al (2013) proposed an algorithm called component average distributed multi-resolution evolving tomography (CA-DMET) which involved modification of component average type algorithms such as CAV and CARP for seismic tomography.…”
Section: Related Workmentioning
confidence: 99%
“…However, its shared memory implementation requires the duplication of certain nodes in the interior of the MPI partition. Such an approach was pursued in [44] for GPUs, but with moderate success. We will show in Section 5 that the increase in "averaged" vector entries leads to a significant memory overhead in weak and strong scaling experiments with CARP, so that fewer nodes can be used compared to the hybrid approach.…”
Section: Hybrid-parallel Approachmentioning
confidence: 99%
“…Its main advantages are robustness and cyclic convergence on inconsistent systems. The block parallel versions of ART have been studied and the comparison of various methods based on their performance on graphics processing units is discussed in Elble, Sahinidis, and Vouzis (2010). Among them, Cimmino's method (Cimmino, 1938) was the first one to introduce a parallel version of ART that guaranteed convergence even in case of inconsistent-system.…”
Section: Distributed Tomography Algorithmsmentioning
confidence: 99%