2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC) 2018
DOI: 10.1109/dac.2018.8465797
|View full text |Cite
|
Sign up to set email alerts
|

Approximation-Aware Coordinated Power/Performance Management for Heterogeneous Multi-cores

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
1
1

Relationship

3
2

Authors

Journals

citations
Cited by 8 publications
(13 citation statements)
references
References 7 publications
0
13
0
Order By: Relevance
“…Table 9.1 Average, standard deviation, and overall contribution to the overall latency of each actor of the object detection application in Fig. 9.6 when processing a test sequence of 9 149 images and executed each in isolation on a single core running constantly at maximum frequency according to [3,10,21,26] to enforce the global latency upper bound UB L = 115 ms for the given application. Due to the small variation and overall latency contribution of all except the actors SD and SM according to Table 9.1, we dedicate a time budget of 20 ms to the other actors altogether, assuming that their cumulative latency per input image does not exceed this budget.…”
Section: Enforcement Problem Descriptionmentioning
confidence: 99%
“…Table 9.1 Average, standard deviation, and overall contribution to the overall latency of each actor of the object detection application in Fig. 9.6 when processing a test sequence of 9 149 images and executed each in isolation on a single core running constantly at maximum frequency according to [3,10,21,26] to enforce the global latency upper bound UB L = 115 ms for the given application. Due to the small variation and overall latency contribution of all except the actors SD and SM according to Table 9.1, we dedicate a time budget of 20 ms to the other actors altogether, assuming that their cumulative latency per input image does not exceed this budget.…”
Section: Enforcement Problem Descriptionmentioning
confidence: 99%
“…Performance models have been developed to estimate throughput due to configuration variations only on the CPU. Throughput (th j ) of an application j (or in general its performance) running on a single cluster i (either big or LITTLE) has an almost linear relationship with the CPU quota Q j assigned to the application and cluster's frequency level f i , whereas there is a sublinear relationship with the parallelization level #t (in number of assigned CPU cores), as shown in [4], [13]. Furthermore, as in [3], it is reasonable to estimate an average performance ratio between big and LITTLE clusters at the same baseline frequency, namely r b→L .…”
Section: A Power and Performance Modelsmentioning
confidence: 99%
“…Runtime Resource Controller. We implemented a controller similar to the one proposed in [13]. It consists of a process running in user-space, capable of 1) accessing all HW sensors and knobs through Linux interface, 2) controlling process allocation by means of the cgroups mechanism, and 3) measuring applications' performance in terms of throughput with the HeartBeat mechanism.…”
Section: Introductionmentioning
confidence: 99%
“…This can be extended in the context of both multi-core and many-core systems running emerging workloads from media processing domains and can be exploited opportunistically subject to application requirements (El-Harouni et al, 2017;Palomino et al, 2016). Particularly with battery operated mobile processors and resource constrained systems, approximation can be used for efficient resource allocation (Kanduri et al, 2018b).…”
Section: Accuracy Bound Qos Managementmentioning
confidence: 99%