2010 International Conference on Dependable Systems and Networks Workshops (DSN-W) 2010
DOI: 10.1109/dsnw.2010.5542626
|View full text |Cite
|
Sign up to set email alerts
|

Accurate fault prediction of BlueGene/P RAS logs via geometric reduction

Abstract: This investigation presents two distinct and novel approaches for the prediction of system failures occurring inOak Ridge National Laboratory's Blue Gene/P supercomputer. Each technique uses raw numeric and textual subsets of large data logs of physical system information such as fan speeds and CPU temperatures. This data is used to develop models of the system capable of sensing anomalies, or deviations from nominal behavior. Each algorithm predicted event log reported anomalies in advance of their occurrence… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
5
1

Relationship

3
3

Authors

Journals

citations
Cited by 9 publications
(12 citation statements)
references
References 9 publications
0
12
0
Order By: Relevance
“…Our work is inspired by the Huan's work [29] and Thompson's work [30]. In [29], a KD-tree based sampling method was proposed for informative instance selection.…”
Section: E Results Summarymentioning
confidence: 99%
See 1 more Smart Citation
“…Our work is inspired by the Huan's work [29] and Thompson's work [30]. In [29], a KD-tree based sampling method was proposed for informative instance selection.…”
Section: E Results Summarymentioning
confidence: 99%
“…In this paper, we modify the KD-tree algorithm to rule out instances irrelevant to abnormal states. In [30], Thompson utilized two feature extraction methods on numeric data for failure prediction in the Blue Gene/P system, which was based on the same dataset in our first case study. However, their study was only on the feature level, which was insufficient for data reduction.…”
Section: E Results Summarymentioning
confidence: 99%
“…After initializing P (1) , the algorithm proceeds as follows: at each iteration i we calculate the 2 -norms of all secants under the current projection. In other words we find the column of P (i) T S with the smallest 2 norm; call this column index j * .…”
Section: The Secant-avoidance Projection Algorithmmentioning
confidence: 99%
“…Figure 2 shows the projection into R 3 obtained from PCA (i.e. the projection of the data from R 10 to R 3 via Pr (1) ), and Figure 3 shows the projection onto R 3 obtained from 100 iterations of the Secant-Avoidance Projection algorithm. In each case, Fig.…”
Section: Curvesmentioning
confidence: 99%
See 1 more Smart Citation