2013 IEEE International Conference on Cluster Computing (CLUSTER) 2013
DOI: 10.1109/cluster.2013.6702612
|View full text |Cite
|
Sign up to set email alerts
|

GPU-accelerated scalable solver for banded linear systems

Abstract: Solving a banded linear system efficiently is im portant to many scientific and engineering applications. Current solvers achieve good scalability only on the linear systems that can be partitioned into independent subsystems. In this paper, we present a GPU based, scalable Bi-Conj ugate Gradient Stabilized solver that can be used to solve a wide range of banded linear systems. We utilize a row-oriented matrix decomposition method to divide the banded linear system into several correlated sub linear systems an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 35 publications
0
1
0
Order By: Relevance
“…parameters are time-independent (i.e., we neglect corona effect that results in the time-dependent dynamic capacitance matrix C dyn (t)), the matrix A is fixed and needs to be computed only once. Moreover, on each voltage/current node, two of the three blocks are identity (unitary) matrices I. Consequently, the linear matrix system can be efficiently solved in band-storage form, using a parallelized or GPU CUDA solver whose performance scales with bandwidth (related to the number of wires N w ) and not with matrix dimension [50].…”
Section: Crank-nicolson Schemementioning
confidence: 99%
“…parameters are time-independent (i.e., we neglect corona effect that results in the time-dependent dynamic capacitance matrix C dyn (t)), the matrix A is fixed and needs to be computed only once. Moreover, on each voltage/current node, two of the three blocks are identity (unitary) matrices I. Consequently, the linear matrix system can be efficiently solved in band-storage form, using a parallelized or GPU CUDA solver whose performance scales with bandwidth (related to the number of wires N w ) and not with matrix dimension [50].…”
Section: Crank-nicolson Schemementioning
confidence: 99%