2015
DOI: 10.2172/1184174
|View full text |Cite
|
Sign up to set email alerts
|

Detecting Soft Errors in Stencil based Computations

Abstract: Given the growing emphasis on system resilience, it is important to develop software-level error detectors that help trap hardware-level faults with reasonable accuracy while minimizing false alarms as well as the performance overhead introduced. We present a technique that approaches this idea by taking stencil computations as our target, and synthesizing detectors based on machine learning. In particular, we employ linear regression to generate computationally inexpensive models which form the basis for erro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
6
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 37 publications
0
6
0
Order By: Relevance
“…Our work focuses on soft error detection for stencil programs. Application-independent approaches for iterative programs rely on observing the evolution of a value over time to detect anomalies (e.g., AID [17] uses curve fitting, SSD [46] uses support vector machines (SVM) regression, and Reference [42] uses a machine learning-based approach to build regression models for synthesizing low cost detectors). Gomez and Capello exploits multivariate interpolation to detect and correct corruption in stencil application [25].…”
Section: Additional Related Workmentioning
confidence: 99%
“…Our work focuses on soft error detection for stencil programs. Application-independent approaches for iterative programs rely on observing the evolution of a value over time to detect anomalies (e.g., AID [17] uses curve fitting, SSD [46] uses support vector machines (SVM) regression, and Reference [42] uses a machine learning-based approach to build regression models for synthesizing low cost detectors). Gomez and Capello exploits multivariate interpolation to detect and correct corruption in stencil application [25].…”
Section: Additional Related Workmentioning
confidence: 99%
“…This technique assumes immediate error detection. Sharma et al [32] proposed an error detection method for stencil-based applications using the predicted values by a regression model. Dubey et al [18] explored local recovery schemes for applications using structured adaptive mesh refinement (AMR).…”
Section: Related Workmentioning
confidence: 99%
“…They convert the problem of detecting SDC into next-step prediction problem. Sharma et al [26] utilize temporal features of data (in addition to spatial features) and provide a tailored SDC detector for stencil applications where they use support vector machines as a linear function approximator. The main drawbacks of temporal data analytics are the memory overhead and the computation cost of maintaining snapshot data.…”
Section: Related Workmentioning
confidence: 99%