2014 IEEE International Parallel &Amp; Distributed Processing Symposium Workshops 2014
DOI: 10.1109/ipdpsw.2014.91
|View full text |Cite
|
Sign up to set email alerts
|

Metrics for Evaluating Energy Saving Techniques for Resilient HPC Systems

Abstract: The metrics used for evaluating energy saving techniques for future HPC systems are critical to the correct assessment of proposed methods. Current predictions forecast that overcoming reduced system reliability, increased power requirements and energy consumption will be a major design challenge for future systems. Modern runtime energy-saving research efforts do not take into account the energy spent providing reliability. They also do not account for the increase in the probability of failure during applica… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 21 publications
0
3
0
Order By: Relevance
“…We demonstrated this technique both for single-node applications and for parallel multi-node MPI+OpenMP applications. We demonstrated reductions in power consumption of 7.4% in the Lulesh parallel MPI+OpenMP application with no increase in application execution time [51,29].…”
Section: Introductionmentioning
confidence: 91%
See 1 more Smart Citation
“…We demonstrated this technique both for single-node applications and for parallel multi-node MPI+OpenMP applications. We demonstrated reductions in power consumption of 7.4% in the Lulesh parallel MPI+OpenMP application with no increase in application execution time [51,29].…”
Section: Introductionmentioning
confidence: 91%
“…Therefore by decreasing the number of these events, even more energy is saved. Figure 2.4 shows the potential benefits based on production checkpoint and restart parameters [29]. Furthermore, improvements in power controls of future microprocessors will make changing power states easier and more effective.…”
Section: Conclusion and Impactmentioning
confidence: 99%
“…e study in [14] presents how to adapt performance measuring tools for energy efficiency management of parallel applications, specifically the libadapt library and an OpenMP wrapper. e study in [15] presents a survey of several energy savings methodologies with analysis concerning their effectiveness in an environment in which failures do occur. Energy costs of reliability are considered.…”
Section: Existing Surveysmentioning
confidence: 99%