2022
DOI: 10.1007/978-3-030-97759-7_1
|View full text |Cite
|
Sign up to set email alerts
|

Can Fortran’s ‘do concurrent’ Replace Directives for Accelerated Computing?

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(8 citation statements)
references
References 6 publications
0
8
0
Order By: Relevance
“…Due to the novelty of DC support for GPUs, there are only a few examples of its use in the literature (such as a spherical surface diffusion tool called DIFFUSE [9], and an implementation of the BabelStream benchmark [10]). There are additional works in progress, including a chemistry mini-app called CCSD(T) 4 , a CFD weather mini-app called MiniWeather 5 , a hydrodynamics mini-app called CloverLeaf 6 , and a conjugate gradient solver used in Solar physics called POT3D 7 .…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Due to the novelty of DC support for GPUs, there are only a few examples of its use in the literature (such as a spherical surface diffusion tool called DIFFUSE [9], and an implementation of the BabelStream benchmark [10]). There are additional works in progress, including a chemistry mini-app called CCSD(T) 4 , a CFD weather mini-app called MiniWeather 5 , a hydrodynamics mini-app called CloverLeaf 6 , and a conjugate gradient solver used in Solar physics called POT3D 7 .…”
Section: Related Workmentioning
confidence: 99%
“…It can run simulations containing over three hundred million grid cells [23], [24] and exhibits performance scaling to thousands of CPU cores or dozens of GPUs [22], [25]. The MAS code uses a logically rectangular non-uniform staggered spherical grid and finite-difference discretizations with a combination of explicit and implicit time-stepping methods 9 . The code is highly memory-bound, with its performance typically proportional to the hardware's memory bandwidth.…”
Section: The Mas Solar Mhd Modelmentioning
confidence: 99%
See 2 more Smart Citations
“…We have implemented options both to use device memory in DoConcurrent and to use managed memory in the other models, to allow apples-to-apples and oranges-to-oranges comparisons, but results with those options are not included. There is value in evaluating idiomatic versus mixed-mode programming in the context of more complex algorithms and applications, as has been done in [43].…”
Section: Gpu Experimentsmentioning
confidence: 99%