2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture (ISCA) 2021
DOI: 10.1109/isca52012.2021.00075
|View full text |Cite
|
Sign up to set email alerts
|

Demystifying the System Vulnerability Stack: Transient Fault Effects Across the Layers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 42 publications
(11 citation statements)
references
References 67 publications
0
11
0
Order By: Relevance
“…However, evaluations on complete designs and large applications might involve unfeasible evaluation times. Other strategies, including software-based error propagation on GPUs, depend on the error models used, which might induce simplified evaluations and inaccurate analyses [28]. Thus, strategies to effectively characterize fault effects while providing an acceptable trade-off between performance and accuracy are still required.…”
Section: A Motivation and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…However, evaluations on complete designs and large applications might involve unfeasible evaluation times. Other strategies, including software-based error propagation on GPUs, depend on the error models used, which might induce simplified evaluations and inaccurate analyses [28]. Thus, strategies to effectively characterize fault effects while providing an acceptable trade-off between performance and accuracy are still required.…”
Section: A Motivation and Related Workmentioning
confidence: 99%
“…Low-level hardware fault simulations (e.g., gate level) offer detailed insights but are time-consuming, while higher-level hardware or software simulations (e.g., microarchitectural or software) provide faster but less accurate results. Additionally, selecting appropriate fault models for simulation is critical to prevent inaccurate conclusions [42]. In contrast, physical fault injection using radiation exposes the system to a beam of ionizing particles, offering a means to estimate realistic error rates.…”
Section: A Motivation and Related Workmentioning
confidence: 99%
“…Nevertheless, existing reliability optimization techniques for traditional edge computing can serve as a guide for improving the reliability in serverless edge computing scenarios. This is because both scenarios are susceptible to common failure types, including transient faults caused by electromagnetic interference and cosmic radiation [23] and bit errors during data communication [24]. In the literature, researchers have explored reliability-aware computation offloading and assignment [15], developed frameworks to minimize the block error probability during communication [16], devised strategies to tolerate transient faults and bit errors [17].…”
Section: Reliability Augmentationmentioning
confidence: 99%
“…Faults can be injected at different levels of abstraction, from low-level Register-Transfer Level (RTL) [32]- [34] to microarchitecture [35], [36] and software [19], [37]- [41]. Each level of abstraction provides the propagation probability of the injected fault to the output, measured in terms of Hardware, Architectural, Software, or Program Vulnerability Factor (HVF, AVF, SVF, PVF, respectively) [42], [43]. Fault injection assumes that the fault already occurred and provides the fault propagation through the system.…”
Section: Radiation Induced Errors In Dnnsmentioning
confidence: 99%