2018
DOI: 10.1007/978-3-319-69953-0_7
|View full text |Cite
|
Sign up to set email alerts
|

MACC: An OpenACC Transpiler for Automatic Multi-GPU Use

Abstract: Abstract. Graphics Processing Units (GPUs) perform the majority of computations in state-of-the-art supercomputers. Programming these GPUs is often assisted using a programming model such as (amongst others) the directive-driven OpenACC. Unfortunately, OpenACC (and other similar models) are incapable of automatically targeting and distributing work across several GPUs, which decreases productivity and forces needless manual labor upon programmers. We propose a method that enables OpenACC applications to target… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
20
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(20 citation statements)
references
References 13 publications
0
20
0
Order By: Relevance
“…Note that our idea may be inferior in terms of productivity compared to [12]. However, the purpose of [12] is to execute a code on multiple accelerators environment without changing code on a single accelerator environment, so the purpose is different from ours. In addition, unlike [10,26], our method does not change the host side programming.…”
Section: Discussionmentioning
confidence: 96%
See 4 more Smart Citations
“…Note that our idea may be inferior in terms of productivity compared to [12]. However, the purpose of [12] is to execute a code on multiple accelerators environment without changing code on a single accelerator environment, so the purpose is different from ours. In addition, unlike [10,26], our method does not change the host side programming.…”
Section: Discussionmentioning
confidence: 96%
“…Also, unlike [23], the proposed directives can upload data at an arbitrary position. Moreover, unlike [12] proposed directives can utilize all accelerator memory effectively. In addition, while [23] and [12] may insert communication implicitly, our proposed OpenMP extension makes users insert communication explicitly using simple directives to perform performance tuning easily.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations