Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing 2013
DOI: 10.1145/2493123.2462920
|View full text |Cite
|
Sign up to set email alerts
|

Correcting soft errors online in LU factorization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
55
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
5
4
1

Relationship

2
8

Authors

Journals

citations
Cited by 37 publications
(55 citation statements)
references
References 19 publications
0
55
0
Order By: Relevance
“…This has been accomplished for matrix-vector multiplications by adding a checksum row in a matrix [22], but also for other operations such as QR and LU factorizations [13], [19]. This approach adds little memory space overhead at the price of computational overhead.…”
Section: Checkpointless Algebraic Recoveriesmentioning
confidence: 99%
“…This has been accomplished for matrix-vector multiplications by adding a checksum row in a matrix [22], but also for other operations such as QR and LU factorizations [13], [19]. This approach adds little memory space overhead at the price of computational overhead.…”
Section: Checkpointless Algebraic Recoveriesmentioning
confidence: 99%
“…These provide the necessary building blocks to address complete applications; for example, a version of the HPLinpack benchmark that handles fail-stop faults with low overhead has been demonstrated [31]. Recent work has extended to the handling of soft faults or the fail-continue case for dense matrix operations [30].…”
Section: Algorithmic Approachesmentioning
confidence: 99%
“…Algorithm-Based Fault Tolerance: ABFT has been actively researched for a handful of popular algorithms, including general matrix multiplication [39], iterative methods for solving linear equations [8], factorization (Cholesky [38], LU [9,10,14] and QR [14]). However, previous work focuses on the algorithms themselves and never uses a holistic view of the entire resilience ecosystem.…”
Section: Related Workmentioning
confidence: 99%