2013 IEEE 19th International on-Line Testing Symposium (IOLTS) 2013
DOI: 10.1109/iolts.2013.6604088
|View full text |Cite
|
Sign up to set email alerts
|

Increasing the robustness of CUDA Fermi GPU-based systems

Abstract: Nowadays, Graphical processing Units (GPUs) have become increasingly popular due to their high computational power and low prices. This makes them particularly suitable for high-performance computing applications, like data elaboration and image processing. In these fields, the capability of properly work even in presence of faults is mandatory. This paper presents an innovative approach, that combines a Software Based Self Test & Diagnosis (SBSTD) methodology with a fault mitigation strategy, to increase the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 10 publications
0
3
0
Order By: Relevance
“…Moreover, the same method was partially applied to functionally test the special function units (SFUs) in the GPGPUs. Other works introduced application robustness [31] and mitigation strategies [32] for the data-path modules. Those methods employed combinations of high-level languages and in-line assembly code again.…”
Section: Related Work In the Areamentioning
confidence: 99%
“…Moreover, the same method was partially applied to functionally test the special function units (SFUs) in the GPGPUs. Other works introduced application robustness [31] and mitigation strategies [32] for the data-path modules. Those methods employed combinations of high-level languages and in-line assembly code again.…”
Section: Related Work In the Areamentioning
confidence: 99%
“…Program duplication was employed in the past by duplicating the execution of the entire program [11]. Similarly, the authors in [7] targeted faults affecting the general-purpose register files in the GPU by replicating the whole assembly code in an intertwined fashion and reaching up to 99% error reduction at a performance cost of up to 78%.…”
Section: A Software-based Hardening Techniques For Gpusmentioning
confidence: 99%
“…Other works introduced functional tests [17,18], fault detection [19][20][21][22], and mitigation [23][24][25] strategies only based on software mechanisms. These solutions are effective in detecting most faults and tolerating a high percentage of them.…”
Section: Introductionmentioning
confidence: 99%