2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2016
DOI: 10.1109/ipdps.2016.100
|View full text |Cite
|
Sign up to set email alerts
|

Reducing Waste in Extreme Scale Systems through Introspective Analysis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
31
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 26 publications
(31 citation statements)
references
References 17 publications
0
31
0
Order By: Relevance
“…While we demonstrated previously that such scheme could result in satisfying results [7], one could imagine more complex identifiers using real-time information on the system to adapt its detection.…”
Section: B Identifying Regime Changesmentioning
confidence: 90%
See 3 more Smart Citations
“…While we demonstrated previously that such scheme could result in satisfying results [7], one could imagine more complex identifiers using real-time information on the system to adapt its detection.…”
Section: B Identifying Regime Changesmentioning
confidence: 90%
“…In a previous work [7], we show how to leverage failure correlation to reduce wasted time. Although that work included a preliminary analysis of the different types of event and a simple monitoring tool, the paper focuses more on giving a picture of the whole structure and not on monitoring.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…We base our analysis on publicly available failure logs from LANL [15,14] and Tsubame [23]. We show that a previously proposed approach based on degraded intervals [3] leads to incorrect results, and we propose a new algorithm to detect failure cascades, based on the study of pairs of consecutive IATs. This new algorithm is used for the largest six public logs at our disposal, and we detect cascades in one log for sure, and possibly in a second one.…”
Section: Introductionmentioning
confidence: 99%