2013 21st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing 2013
DOI: 10.1109/pdp.2013.16
|View full text |Cite
|
Sign up to set email alerts
|

Performance Traps in OpenCL for CPUs

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 14 publications
(1 citation statement)
references
References 10 publications
0
1
0
Order By: Relevance
“…OpenCL programs are compiled just in time for execution and can be used together with Mi-AccLib or other run-time libraries. These works [16][17][18] experienced a performance penalty on the NVIDIA GPU, due to the OpenCL abstraction layer. Thus, we have disabled OpenCL support as it is not optimized for GPUs at the moment, and real gains on GPUs can only be seen through optimized code as there are additional overheads from data movement.…”
Section: Related Workmentioning
confidence: 99%
“…OpenCL programs are compiled just in time for execution and can be used together with Mi-AccLib or other run-time libraries. These works [16][17][18] experienced a performance penalty on the NVIDIA GPU, due to the OpenCL abstraction layer. Thus, we have disabled OpenCL support as it is not optimized for GPUs at the moment, and real gains on GPUs can only be seen through optimized code as there are additional overheads from data movement.…”
Section: Related Workmentioning
confidence: 99%