2016
DOI: 10.3390/computation4030029
|View full text |Cite
|
Sign up to set email alerts
|

DiamondTorre Algorithm for High-Performance Wave Modeling

Abstract: Effective algorithms of physical media numerical modeling problems' solution are discussed. The computation rate of such problems is limited by memory bandwidth if implemented with traditional algorithms. The numerical solution of the wave equation is considered. A finite difference scheme with a cross stencil and a high order of approximation is used. The DiamondTorre algorithm is constructed, with regard to the specifics of the GPGPU's (general purpose graphical processing unit) memory hierarchy and parallel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2018
2018
2019
2019

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 21 publications
(8 citation statements)
references
References 15 publications
0
8
0
Order By: Relevance
“…• DiamondTorre algorithm [4] implemented for CPU with the use of SIMD (instead of CUDA threads) and OpenMP (instead of CUDA blocks).…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…• DiamondTorre algorithm [4] implemented for CPU with the use of SIMD (instead of CUDA threads) and OpenMP (instead of CUDA blocks).…”
Section: Resultsmentioning
confidence: 99%
“…The shape in the time-space can be thought of as a code block for computation of the points inside it. Then an LRnLA algorithm is defined as a shape with its subdivision rule [4].…”
Section: Time-space Decompositionmentioning
confidence: 99%
See 2 more Smart Citations
“…On the contrary, quite a few tools, provided to the developer in modern C++ language, significantly speed up effective implementation of complex computational algorithms. In particular, LRnLA (locally recursive non-local-asynchronous) algorithms, which provide extremely high performance in problems of numerical simulation [6,7,8,9], were implemented on combination of C++ and Python languages with heavy use of the C++ templates mechanisms.…”
Section: Introductionmentioning
confidence: 99%