Proceedings of the 8th Workshop on General Purpose Processing Using GPUs 2015
DOI: 10.1145/2716282.2716284
|View full text |Cite
|
Sign up to set email alerts
|

Helium: a transparent inter-kernel optimizer for OpenCL

Abstract: State of the art automatic optimization of OpenCL applications focuses on improving the performance of individual compute kernels. Programmers address opportunities for inter-kernel optimization in specific applications by ad-hoc hand tuning: manually fusing kernels together. However, the complexity of interactions between host and kernel code makes this approach weak or even unviable for applications involving more than a small number of kernel invocations or a highly dynamic control flow, leaving substantial… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 18 publications
0
5
0
Order By: Relevance
“…When some nodes experience a high volume of data, the buffer will fill up and packets may be dropped. Configuring the kernel parameter in the New Application Programming Interface (NAPI) can enhance kernel performance by increasing the level of optimization and selecting multivariate features such as kernel complex quantitative near-infrared (K-NIR), kernel support vector regression (k-SVR), or kernel partial least squares (K-PLS) to improve the accuracy of packet processing [23,24,25,26 ]. In order to hold and process packets quickly, these kernel performance enhancements pull a high value of packets from interfaces and bind them with obtainable CPU cycles, which limit packet speed and time and have no buffer memory.…”
Section: Technical Discussionmentioning
confidence: 99%
“…When some nodes experience a high volume of data, the buffer will fill up and packets may be dropped. Configuring the kernel parameter in the New Application Programming Interface (NAPI) can enhance kernel performance by increasing the level of optimization and selecting multivariate features such as kernel complex quantitative near-infrared (K-NIR), kernel support vector regression (k-SVR), or kernel partial least squares (K-PLS) to improve the accuracy of packet processing [23,24,25,26 ]. In order to hold and process packets quickly, these kernel performance enhancements pull a high value of packets from interfaces and bind them with obtainable CPU cycles, which limit packet speed and time and have no buffer memory.…”
Section: Technical Discussionmentioning
confidence: 99%
“…In [29], the authors proposed a framework combining OpenCL application auto-tuning and runtime resource management. The study in [30] presented a transparent OpenCL overlay called Helium, for inter-and intra-kernel optimisation. The studies mentioned above are not yet mature and to the best of our knowledge, state-of-the-art research remains at the stage that optimisations are highly dependent on the specific algorithm, architecture, and programming features.…”
Section: B Adas Applications On Heterogeneous Platformsmentioning
confidence: 99%
“…In [7], the authors proposed a framework combing OpenCL application auto-tuning and runtime resource management. Authors in [8] presented a transparent OpenCL overlay called Helium, for inter-and intra-kernel optimization.…”
Section: Related Workmentioning
confidence: 99%