2021
DOI: 10.1109/tc.2020.2980520
|View full text |Cite
|
Sign up to set email alerts
|

HePREM: A Predictable Execution Model for GPU-based Heterogeneous SoCs

Abstract: The ever-increasing need for computational power in embedded devices has led to the adoption of heterogeneous SoCs combining a general purpose CPU with a data parallel accelerator. These systems rely on a shared main memory (DRAM), which makes them highly susceptible to memory interference. A promising software technique to counter such effects is the Predictable Execution Model (PREM). PREM ensures robustness to interference by separating programs into a sequence of memory and compute phases, and by enforcing… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3
1

Relationship

3
6

Authors

Journals

citations
Cited by 11 publications
(10 citation statements)
references
References 34 publications
0
10
0
Order By: Relevance
“…The relevance of this observation becomes clear when we consider the recent efforts that the research community has put into making the adoption of modern HeSoCs feasible in the context of real-time applications [7,16,37]. The stricter the real-time requirements, the more modest the adoption of parallel systems has been so far in this domain.…”
Section: Predictable Execution and Memory Underutilisationmentioning
confidence: 99%
See 1 more Smart Citation
“…The relevance of this observation becomes clear when we consider the recent efforts that the research community has put into making the adoption of modern HeSoCs feasible in the context of real-time applications [7,16,37]. The stricter the real-time requirements, the more modest the adoption of parallel systems has been so far in this domain.…”
Section: Predictable Execution and Memory Underutilisationmentioning
confidence: 99%
“…By scheduling memory phases in a mutually exclusive manner memory contention is avoided. Originally formulated to address concurrent accesses between single-core CPU and devices with direct memory access [24], PREM has been later successfully extended to the case of multi-core CPUs [5,27] and of HeSoCs [15,16,20]. Although effective at guaranteeing predictable timing of memory accesses, PREM-like approaches greatly sacrifice memory bandwidth utilization, as bandwidth in a modern HeSoC is sized to concurrently serve multiple computing units.…”
Section: Introductionmentioning
confidence: 99%
“…The AutoDMA plugin is also able to perform loop tiling to extract segments of code whose memory footprint is small enough to fit in the local memory. The AutoDMA plugin is an extension of HePREM [28], originally envisioned for transforming real-time GPU code to be less sensitive to memory interference. This was achieved by transforming GPU kernels into a series of load, execute, and store phases, with explicit synchronization points between them.…”
Section: Ease Of Programming and Code Portabilitymentioning
confidence: 99%
“…Worst-case execution time analysis: In recent years, works on worst-case execution time (WCET) analysis for GPU programs has gained attention (Betts and Donaldson, 2013;Berezovskyi et al, 2014;Forsberg et al, 2020).…”
Section: Related Workmentioning
confidence: 99%