2012
DOI: 10.1002/cpe.2952
|View full text |Cite
|
Sign up to set email alerts
|

Graphics processing unit acceleration of the red/black SOR method

Abstract: SUMMARYThis work presents our strategy, applied optimizations and results in our effort to exploit the computational capabilities of graphics processing units (GPUs) under the CUDA environment in order to solve the Laplacian PDE. The parallelizable red/black successive over-relaxation (SOR) method was used. Additionally, a program for the CPU was developed as a performance reference. Various performance improvements were achieved by using optimization methods, which proved to provide significant speedup. Memor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
10
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 20 publications
(10 citation statements)
references
References 17 publications
0
10
0
Order By: Relevance
“…As already stated, the SOR method uses the values from the previous iteration and the values from the current iteration to compute the current point, similarly as the Gauss-Seidel method, see (16.2). The parallel GPU implementation of the SOR method is enabled using the Red-Black ordering [15]. Updated values of the black points, i.e., values of the current iteration, are used to compute the red points.…”
Section: Methodsmentioning
confidence: 99%
“…As already stated, the SOR method uses the values from the previous iteration and the values from the current iteration to compute the current point, similarly as the Gauss-Seidel method, see (16.2). The parallel GPU implementation of the SOR method is enabled using the Red-Black ordering [15]. Updated values of the black points, i.e., values of the current iteration, are used to compute the red points.…”
Section: Methodsmentioning
confidence: 99%
“…In , the authors present GPU and CPU implementations of the red/black successive over‐relaxation method, comparing them for a variety of problem sizes. Five GPU kernels are implemented, tuned, and compared.…”
mentioning
confidence: 99%
“…Other contributions that have been used in this thesis include an implementation of a redblack SOR stencil computation method [44,45] which has been utilized in the experiments and it poses as a proof of concept case study in this thesis. A theoretical performance analysis of the algorithm was provided and the implementations included various kernels, each utilizing a different memory caching approach.…”
Section: This Thesis Primary Contributionsmentioning
confidence: 99%
“…The method is E. Konstantinidis described in detail in the next chapter (5.1.1) accompanied with a wide range of experimental results. Furthermore, related work has also been published by the author of this thesis [44,45] and can be referred for more information. After running 4 iterations of the computation on the GTX-480, the accumulated metrics were captured with the nvprof tool for all iterations of the red elements' computation as shown in table 4.4.…”
Section: Case Study 1: Red/black Sor Stencil Computationmentioning
confidence: 99%
See 1 more Smart Citation