2017
DOI: 10.1016/j.microrel.2017.07.035
|View full text |Cite
|
Sign up to set email alerts
|

A low-level software-based fault tolerance approach to detect SEUs in GPUs' register files

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
10
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(11 citation statements)
references
References 3 publications
0
10
0
Order By: Relevance
“…Similarly, some authors targeted and evaluated the effect of faults in data-path units [33], including the register file [34], and pipeline registers [35]. Other work proposed graceful performance degradation strategies to face permanent faults in SM units by employing specially instrumented kernels and coding styles, thus distributing the tasks across the available SMs [32].…”
Section: Related Work In the Areamentioning
confidence: 99%
“…Similarly, some authors targeted and evaluated the effect of faults in data-path units [33], including the register file [34], and pipeline registers [35]. Other work proposed graceful performance degradation strategies to face permanent faults in SM units by employing specially instrumented kernels and coding styles, thus distributing the tasks across the available SMs [32].…”
Section: Related Work In the Areamentioning
confidence: 99%
“…The fault injection environment is based on the ModelSim framework, and the injection methodology we used is the same introduced in [5,20]. Further details regarding the descriptions and configurations of the used benchmarks can be found in [18].…”
Section: Fault Detection Capabilitiesmentioning
confidence: 99%
“…On the one hand, software DWC mechanisms exploit time redundancy by repeatedly executing instructions [5][6][7], functions [8], or application tasks [8][9][10]. At the end, results are compared to detect faults.…”
mentioning
confidence: 99%
“…Software-based approaches provide high detection rates at the cost of performance degradation. They insert additional instructions that must be executed by the processing system, therefore increasing execution runtime, and can be applied to any GPU architecture with an available program source-code [6]. Hardware-based approaches, on the other hand, can be applied with no performance degradation, as replicated hardware can be deployed in parallel with the original, and, as long as the critical path is not altered, the operating frequency can be maintained, but require access to GPU architecture description [7].…”
Section: Introductionmentioning
confidence: 99%