2012 Ninth International Conference on Quantitative Evaluation of Systems 2012
DOI: 10.1109/qest.2012.37
|View full text |Cite
|
Sign up to set email alerts
|

Intermittent Hardware Errors Recovery: Modeling and Evaluation

Abstract: Abstract-The frequency of hardware errors is increasing due to shrinking feature sizes, higher levels of integration, and increasing design complexity. Intermittent errors are those that occur non-deterministically at the same location. It has been shown that intermittent hardware errors contribute to about 39% of the total hardware failures. Intermittent faults have characteristics that are different than transient and permanent errors, which makes it challenging to devise efficient recovery techniques for th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
36
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 21 publications
(36 citation statements)
references
References 28 publications
0
36
0
Order By: Relevance
“…The most important finding of this paper is that the largest source of failure in such data centers is disk failure. Intermittent hardware errors are evaluated in [37].…”
Section: Cloud Computing Reliabilitymentioning
confidence: 99%
See 1 more Smart Citation
“…The most important finding of this paper is that the largest source of failure in such data centers is disk failure. Intermittent hardware errors are evaluated in [37].…”
Section: Cloud Computing Reliabilitymentioning
confidence: 99%
“…An AFR represents the estimated probability that a device will fail during a full year of use. In this study, all AFR values are derived from the work found in [36][37][38][39][40][41][42].…”
Section: Samplingmentioning
confidence: 99%
“…This model can be used only for systems with periodic testing and continuous operation. The impact of various scenarios for restoring the processor after the occurrence of intermittent faults on its performance was assessed in [25]. To achieve this goal, the operation of a fault-tolerant multi-core processor is simulated in the presence of intermittent faults, subject to exponential and Weibull distribution.…”
Section: Introductionmentioning
confidence: 99%
“…their impact in circuits and systems, accurate models are required. With the purpose of providing such models, several works have been studying fault models for dependability analysis [66,136]. In them, it is made clear how important models are to be able to study faults and their effects.…”
Section: Modelingmentioning
confidence: 99%
“…Its drawbacks are it requires 90 B.3 Solutions for detection and diagnosis a long latency to discriminate, and infrastructure to detect and accumulate the respective faults. Other recent studies which also employ SAN with thresholds [136] applied to real systems only consider intermittent errors captured in state variables, which last more than one clock cycle.…”
Section: B23 Fault Diagnosismentioning
confidence: 99%