Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis 2015
DOI: 10.1145/2807591.2807599
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting asynchrony from exact forward recovery for DUE in iterative solvers

Abstract: This paper presents a method to protect iterative solvers from Detected and Uncorrected Errors (DUE) relying on error detection techniques already available in commodity hardware. Detection operates at the memory page level, which enables the use of simple algorithmic redundancies to correct errors. Such redundancies would be inapplicable under coarse grain error detection, but become very powerful when the hardware is able to precisely detect errors.\ud Relations straightforwardly extracted from the solver al… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
22
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
4
3
1

Relationship

4
4

Authors

Journals

citations
Cited by 33 publications
(22 citation statements)
references
References 31 publications
0
22
0
Order By: Relevance
“…• This paper further extends the original conference manuscript [25] with an in-depth study of the effect of page sizes, from 4KB up to 2MB, on the overheads of the techniques. Our algorithmic methods outperform the state-of-the-art on average up to 512KB page sizes.…”
Section: Introductionmentioning
confidence: 65%
See 1 more Smart Citation
“…• This paper further extends the original conference manuscript [25] with an in-depth study of the effect of page sizes, from 4KB up to 2MB, on the overheads of the techniques. Our algorithmic methods outperform the state-of-the-art on average up to 512KB page sizes.…”
Section: Introductionmentioning
confidence: 65%
“…This manuscript is the journal extension of a previously published conference paper [25]. This work has been par- .…”
Section: Acknowledgmentsmentioning
confidence: 98%
“…It computes the solution by building a basis of orthogonal vectors each iteration. We use a sparse matrix version with the task decomposition described by Jaulmes et al [19]. The manual scheduling assigns tasks to sockets in a round-robin fashion.…”
Section: Tested Applicationsmentioning
confidence: 99%
“…The inputs are selected to balance between simulation time and LLC footprint. Finally, we use benchmark CG, a conjugate gradient method [23], implemented in OmpSs by Jaulmes et al [10]. The input is the matrix qa8fm from The University of Florida Sparse Matrix Collection [8].…”
Section: Benchmarksmentioning
confidence: 99%