2022 IEEE/ACM International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems 2022
DOI: 10.1109/pmbs56514.2022.00013
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking Fortran DO CONCURRENT on CPUs and GPUs Using BabelStream

Abstract: Fortran DO CONCURRENT has emerged as a new way to achieve parallel execution of loops on CPUs and GPUs. This paper studies the performance portability of this construct on a range of processors and compares it with the incumbent models: OpenMP, OpenACC and CUDA. To do this study fairly, we implemented the BabelStream memory bandwidth benchmark from scratch, entirely in modern Fortran, for all of the models considered, which include Fortran DO CONCURRENT, as well as two variants of OpenACC, four variants of Ope… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 18 publications
0
2
0
Order By: Relevance
“…Due to the novelty of DC support for GPUs, there are only a few examples of its use in the literature (such as a spherical surface diffusion tool called DIFFUSE [9], and an implementation of the BabelStream benchmark [10]). There are additional works in progress, including a chemistry mini-app called CCSD(T) 4 , a CFD weather mini-app called MiniWeather 5 , a hydrodynamics mini-app called CloverLeaf 6 , and a conjugate gradient solver used in Solar physics called POT3D 7 .…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Due to the novelty of DC support for GPUs, there are only a few examples of its use in the literature (such as a spherical surface diffusion tool called DIFFUSE [9], and an implementation of the BabelStream benchmark [10]). There are additional works in progress, including a chemistry mini-app called CCSD(T) 4 , a CFD weather mini-app called MiniWeather 5 , a hydrodynamics mini-app called CloverLeaf 6 , and a conjugate gradient solver used in Solar physics called POT3D 7 .…”
Section: Related Workmentioning
confidence: 99%
“…From our previous experience in porting OpenACC to DC [7], we learned that using manual data management yields better performance than relying on unified managed memory (UM) capabilities. Unified managed memory is an NVIDIA feature that will automatically page data to and from the GPU and CPU for GPU-accelerated codes 10 . This eases the burden of the programmer by not requiring manual management of the data movement.…”
Section: A Code 1 [A]: Current Openacc Implementationmentioning
confidence: 99%