Proceedings of the 50th Annual Design Automation Conference 2013
DOI: 10.1145/2463209.2488735
|View full text |Cite
|
Sign up to set email alerts
|

Workload and user experience-aware dynamic reliability management in multicore processors

Abstract: Reliability is a major concern for nanoscale CMOS circuits. Degradation phenomena such as Electromigration, Negative Bias Temperature Instability, Time Dependent Dielectric Breakdown worsen with transistor scaling. Dynamic Reliability Management (DRM) techniques reduce reliability loss at runtime by constraining operating points, but they face the challenge of reducing user experience degradation while meeting a lifetime target. In this work we propose a sensor based hierarchical controller for multicore proce… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
47
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 43 publications
(47 citation statements)
references
References 19 publications
0
47
0
Order By: Relevance
“…This is achieved by the technique in [10], which is a workload-aware DRM technique for multiprocessors based on a two-level controller. This technique monitors system reliability on a long time scale and adapts operating conditions to workload quality requirements on a short time scale.…”
Section: A Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…This is achieved by the technique in [10], which is a workload-aware DRM technique for multiprocessors based on a two-level controller. This technique monitors system reliability on a long time scale and adapts operating conditions to workload quality requirements on a short time scale.…”
Section: A Related Workmentioning
confidence: 99%
“…Dynamic Reliability Management (DRM) is a set of techniques trading off processor degradation and performance at runtime [10], [15], [18]. Reliability is periodically assessed, and processor operating conditions are controlled to limit the degradation source (i.e.…”
Section: Introductionmentioning
confidence: 99%
“…However, if the adopted policies do not take into account how they affect the system lifetime, the architecture reliability might be significantly affected. On the other hand, those solutions that focus only on lifetime improvement ( [11,15,14,18,6]) are typically characterized by high energy consumption. This work proposes an approach for designing systems with combined on-line lifetime/energy optimization by exploiting the architecture asymmetry.…”
Section: Introduction and Related Workmentioning
confidence: 99%
“…Proactive energy management involves predicting these dynamic workloads apriori to determine the most appropriate frequency for every phase such that performance constraint is satisfied while minimizing the energy consumption [1]. Studies have been conducted recently to use machine learning to determine the minimum frequency through continuous feedback from the hardware performance monitoring unit (PMU) [2]- [10]. These approaches suffer from the following limitations.…”
Section: Introductionmentioning
confidence: 99%
“…This classifier is queried at run-time for a given application to predict the workload, and select the frequency and thread packing such that performance is maximized under a given power cap. A workload aware approach is proposed in [10] based on control theoretic principles. Our proposed approach differs from these techniques by addressing the three limitations discussed in Section I.…”
Section: Introductionmentioning
confidence: 99%