2022
DOI: 10.1109/tcad.2021.3082863
|View full text |Cite
|
Sign up to set email alerts
|

A Taxonomy of Modern GPGPU Programming Methods: On the Benefits of a Unified Specification

Abstract: Several Application Programming Interfaces (APIs) and frameworks have been proposed to simplify the development of General-Purpose GPU (GPGPU) applications. GPGPU application development typically involves specific customization for the target operating systems and hardware devices. The effort to port applications from one API to the other (or to develop multi-target applications) is complicated by the availability of a plethora of specifications, which in essence offers very similar underlying functionality. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2
1

Relationship

3
4

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 30 publications
0
2
0
Order By: Relevance
“…CUDA enables to dispatch kernels through a launch configuration, i.e. a grid specified by the programmer in which parallel GPU threads are logically organized [4] in blocks of threads. Moreover, it allows the programmer to express an added layer of parallelism through CUDA Streams and Events.…”
Section: Background and Related Workmentioning
confidence: 99%
“…CUDA enables to dispatch kernels through a launch configuration, i.e. a grid specified by the programmer in which parallel GPU threads are logically organized [4] in blocks of threads. Moreover, it allows the programmer to express an added layer of parallelism through CUDA Streams and Events.…”
Section: Background and Related Workmentioning
confidence: 99%
“…Differently, we also ported both obstacle avoidance and the best path selection on the GPU. We use NVIDIA CUDA to exploit GPU parallelism in different ways: using parallel threads [22], exploiting Shared Memory and syncthread directive; overlapping computing and memory transfers using Streams and Events [23].…”
Section: Related Workmentioning
confidence: 99%