Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing 2015
DOI: 10.1145/2749246.2749262
|View full text |Cite
|
Sign up to set email alerts
|

Practical Resource Management in Power-Constrained, High Performance Computing

Abstract: Power management is one of the key research challenges on the path to exascale. Supercomputers today are designed to be worst-case power provisioned, leading to two main problems-limited application performance and under-utilization of procured power. In this paper, we propose RMAP, a practical, low-overhead resource manager targeted at future power-constrained clusters. The goals for RMAP are to improve application performance as well as system power utilization, and thus minimize the average turnaround time … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
47
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
5
3

Relationship

3
5

Authors

Journals

citations
Cited by 74 publications
(47 citation statements)
references
References 45 publications
0
47
0
Order By: Relevance
“…For these reasons we generate our own cluster workload combining single-and multi-node applications, allowing us to measure the performance and power profiles of the workload. A similar methodology is used in other power and manufacturing variability related studies [16,44], but in our approach we use a wider number and range of applications.…”
Section: Methodology Evaluation 41 Experimental Setupmentioning
confidence: 99%
See 2 more Smart Citations
“…For these reasons we generate our own cluster workload combining single-and multi-node applications, allowing us to measure the performance and power profiles of the workload. A similar methodology is used in other power and manufacturing variability related studies [16,44], but in our approach we use a wider number and range of applications.…”
Section: Methodology Evaluation 41 Experimental Setupmentioning
confidence: 99%
“…We extend the default behavior to not exceed the global power budget, by considering the worst case scenario, which is that each job can consume the maximum power budget allowed per socket. Additionally, we extend the scheduler to initiate backfilling for power as well [44]. Typically, if a job requests more sockets than currently available, the scheduler will try to schedule a different job without causing delays.…”
Section: Job Scheduling Policiesmentioning
confidence: 99%
See 1 more Smart Citation
“…Etinski et al [10,9] proposed the use of dynamic voltage and frequency scaling (DVFS) at the job scheduling-level to save energy and improve overall job performance. Patki et al [25] proposed power-aware backfilling to improve the throughput of the system. Ellsworth et al [8] presented a power scheduler that enforced a system-wide power bound by reallocating power across the cluster.…”
Section: Related Workmentioning
confidence: 99%
“…Most real HPC applications do not utilize the peak power allocated power-pernode, leading to inefcient use of both nodes and power. Thus, in average, applications utilize 70% or less of the provisioned power, which leads to an inevitable waste of not only power, but also performance and infrastructure, making clear that hardware solutions are not sufcient and improved software solutions are needed as well for power manage-ment [86,87]. Additionally, modern supercomputers consume an enormous amount of power, where a signicant fraction is dedicated to offer cooling capabilities considering the peak power provision of the whole infrastructure.…”
Section: Rate Monotonic Scheduling (Rms) and Earliest Deadline First mentioning
confidence: 99%