2013
DOI: 10.1007/978-3-642-35473-1_62
|View full text |Cite
|
Sign up to set email alerts
|

A Translation Framework for Automatic Translation of Annotated LLVM IR into OpenCL Kernel Function

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2014
2014
2017
2017

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 1 publication
0
3
0
Order By: Relevance
“…Thus, we allocate contiguous chunks of size N c × N c to hold a full sub-quadtree together with a N b × N b blocking at the lowest level. The chunks are processed using OpenMP and the code can potentially be used without modification on the Intel Xeon Phi coprocessor and through automatic source code translation [114,102,121,57] on GPGPUs. In addition, the use of OpenMP removes Charm++ compile and runtime dependencies for single-node applications, potentially significantly simplifying the build process.…”
Section: Fig 21mentioning
confidence: 99%
“…Thus, we allocate contiguous chunks of size N c × N c to hold a full sub-quadtree together with a N b × N b blocking at the lowest level. The chunks are processed using OpenMP and the code can potentially be used without modification on the Intel Xeon Phi coprocessor and through automatic source code translation [114,102,121,57] on GPGPUs. In addition, the use of OpenMP removes Charm++ compile and runtime dependencies for single-node applications, potentially significantly simplifying the build process.…”
Section: Fig 21mentioning
confidence: 99%
“…The compiler front-end [5] involves lexical analyzing AST parsing, syntax analyzing, address qualifier parsing, vector parsing, CGIR expansion, and WHIRL lowering optimization passes. The compile process of character stream is shown in Figure 9.…”
Section: Compilermentioning
confidence: 99%
“…It is a challenging task to support OpenCL program model on multicore DSP for embedded application. We address this problem by firstly utilizing the LLVM (low level virtual machine) [5] and Clang [6] open source compiler to support kernel compilation and further optimization for the DSP platform; then we designed 2 Advances in Mechanical Engineering [7,8] scheduler that aimed to schedule work-item in a work group to decrease the task switching overhead. Finally, we proposed a kind of software managed CACHE method to efficiently administrate the distributed global memory which was combined through interconnections such as PCIE, SRIO (serial rapid IO), Hyperlink, and SGMII.…”
Section: Introductionmentioning
confidence: 99%