Proceedings of the 28th International Conference on Compiler Construction 2019
DOI: 10.1145/3302516.3307350
|View full text |Cite
|
Sign up to set email alerts
|

PPOpenCL: a performance-portable OpenCL compiler with host and kernel thread code fusion

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2020
2020
2020
2020

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(8 citation statements)
references
References 47 publications
0
8
0
Order By: Relevance
“…So there are multiple attempts to improve crossplatform performance portability for OpenCL platforms. One of them is a source‐to‐source OpenCL compiler, PPOpenCL 5 . It was implemented in Clang and is based on fusing the host and kernel thread codes of an OpenCl program.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…So there are multiple attempts to improve crossplatform performance portability for OpenCL platforms. One of them is a source‐to‐source OpenCL compiler, PPOpenCL 5 . It was implemented in Clang and is based on fusing the host and kernel thread codes of an OpenCl program.…”
Section: Related Workmentioning
confidence: 99%
“…The main ideas of OpenCL are expressed based on the following hierarchy of models: (i) platform model, (ii) execution model, (iii) memory, and (iv) programming models. In particular, OpenCL provides a platform‐independent abstract platform model that allows arranging computations and data access 5 . This model is based on the host‐centric view, where the platform consists of a host connected to one or more OpenCL compute devices (e.g., GPUs).…”
Section: Heterogeneous Computing Systems and Programming Modelsmentioning
confidence: 99%
See 2 more Smart Citations
“…The work-items are organized into work-groups with each work-group running on one CU, which is mapped to one CPE in SWCL. Thus the workitems are executed on the CPE serially, which is referred to as serial execution mode in [41]. Work-groups are statically assigned to CPEs using block distribution, by introducing an explicit loop nest, i.e., a work-group loop for each CPE, and work-group barriers are thus supported by loop fission, as in POCL [27], MOCL [72] and SNU-OCL [32].…”
Section: Basic Opencl Implementation On Sw26010mentioning
confidence: 99%