2019
DOI: 10.1007/978-3-030-12274-4_4
|View full text |Cite
|
Sign up to set email alerts
|

A Case Study for Performance Portability Using OpenMP 4.5

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
3

Relationship

2
4

Authors

Journals

citations
Cited by 28 publications
(22 citation statements)
references
References 9 publications
0
21
1
Order By: Relevance
“…On CPUs, that support array reductions, we can perform a reduction on the real and imaginary array equivalents rather than the scalars. A detailed analysis of our methodology is presented in Reference 24 .…”
Section: Applications: Porting and Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…On CPUs, that support array reductions, we can perform a reduction on the real and imaginary array equivalents rather than the scalars. A detailed analysis of our methodology is presented in Reference 24 .…”
Section: Applications: Porting and Resultsmentioning
confidence: 99%
“…The GPP and FF mini‐apps represent the General Plasmon Pole and Full Frequency self‐energy summations in BerkeleyGW. The GPP mini‐app already has documented OpenMP 4.5 and OpenACC ports 17 …”
Section: Introductionmentioning
confidence: 99%
“…Because of the early adoption of OpenMP directives, we were able to learn from the experiences of Vergara Larrea, et al [12] who used OpenMP 4.0 directives to port codes to NVIDIA GPUs. The challenges of using OpenMP 4.5 for performance portability has been documented in detail in work by Gayatri, et al [13] This study laid the groundwork for improving TestSNAP serial version using OpenMP. From this study, it was observed that the collapse clause would be better optimized using the column-major data storage format for 2D and higher dimensional arrays.…”
Section: Related Workmentioning
confidence: 89%
“…In addition, the latest versions of the Nvidia HPC Software Development Kit (SDK) provide new tools and libraries designed to maximize performance by optimizing memory transfers and scaling to multiple devices while targeting heterogeneous resources [52]. Additionally, various studies regarding vendor-agnostic offloading approaches show promising results based on standard APIs and/or opensource, non-proprietary solutions [53,54]. These would be very interesting to explore in future iterations.…”
Section: Discussionmentioning
confidence: 99%