Proceedings of the 3rd Workshop on Fault-Tolerance for HPC at Extreme Scale 2013
DOI: 10.1145/2465813.2465816
|View full text |Cite
|
Sign up to set email alerts
|

Neutron sensitivity and software hardening strategies for matrix multiplication and FFT on graphics processing units

Abstract: In this paper, we compare the radiation response of GPUs executing matrix multiplication and FFT algorithms. The provided experimental results demonstrate that for both algorithms, in the majority of cases, the output is affected by multiple errors. The architectural and code analysis highlight that multiple errors are caused by shared resources corruption or thread dependencies. The experimental data and analytical studies can be fruitfully employed to evaluate the expected error rate of GPUs in realistic app… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2013
2013
2015
2015

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 18 publications
(16 reference statements)
0
2
0
Order By: Relevance
“…In this model, an adaptive adversary can corrupt up to δ memory cells of a large unreliable memory at any time (even simultaneously) during the execution of an algorithm. Resilient algorithmic techniques have been designed for many problems, including sorting [6], selection [7], dynamic programming [8], dictionaries [9], priority queues [10], matrix multiplication and FFT [11], K-d and suffix trees [12,13]. Resilient algorithms have also been experimentally evaluated [14,11,15,16].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In this model, an adaptive adversary can corrupt up to δ memory cells of a large unreliable memory at any time (even simultaneously) during the execution of an algorithm. Resilient algorithmic techniques have been designed for many problems, including sorting [6], selection [7], dynamic programming [8], dictionaries [9], priority queues [10], matrix multiplication and FFT [11], K-d and suffix trees [12,13]. Resilient algorithms have also been experimentally evaluated [14,11,15,16].…”
Section: Introductionmentioning
confidence: 99%
“…Resilient algorithmic techniques have been designed for many problems, including sorting [6], selection [7], dynamic programming [8], dictionaries [9], priority queues [10], matrix multiplication and FFT [11], K-d and suffix trees [12,13]. Resilient algorithms have also been experimentally evaluated [14,11,15,16].…”
Section: Introductionmentioning
confidence: 99%