2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI) 2016
DOI: 10.1109/icacci.2016.7732032
|View full text |Cite
|
Sign up to set email alerts
|

Automatic program generation for heterogeneous architectures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 23 publications
0
4
0
Order By: Relevance
“…This tool provides hybrid-parallel numerical kernels, intelligent resource management, and truly heterogeneous parallelism for multicore CPUs, Nvidia GPUs, and the Intel Xeon Phi (CPU-GPU-PHI) However, the data partitioning is static and is based on single-number performances of devices. Deepika et al [31] present a tool called OpenCLGen. OpenCLGen is a web-based software to automate OpenCL program generation for Single Kernel on Single Device as well as Multiple Kernels on Multiple Devices.…”
Section: B Programming Models and Tools For Hpcmentioning
confidence: 99%
“…This tool provides hybrid-parallel numerical kernels, intelligent resource management, and truly heterogeneous parallelism for multicore CPUs, Nvidia GPUs, and the Intel Xeon Phi (CPU-GPU-PHI) However, the data partitioning is static and is based on single-number performances of devices. Deepika et al [31] present a tool called OpenCLGen. OpenCLGen is a web-based software to automate OpenCL program generation for Single Kernel on Single Device as well as Multiple Kernels on Multiple Devices.…”
Section: B Programming Models and Tools For Hpcmentioning
confidence: 99%
“…In this study, we propose a method that ensures interworking between an artificial intelligence system and a neural network processing system by supporting the interoperable neural network standard format established by Khronos . We developed a parser for neural networks and data based on NNEF to achieve a convenient configuration for the neural networks used in a cross‐platform.…”
Section: Introductionmentioning
confidence: 99%
“…The abstractions introduced by OpenCL have been proved that prevent the obtaining of the same efficiency as when using directly the vendor programming models, for several common situations [10]. As a result, many state-of-the-art heterogeneous frameworks and libraries of higher level of abstraction [9,13,8,18,3,17], that rely on OpenCL as execution layer, typically inherit some of these problems.…”
Section: Introductionmentioning
confidence: 99%