2014
DOI: 10.1016/j.future.2013.07.010
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting performance counters to predict and improve energy performance of HPC systems

Abstract: International audienceHardware monitoring through performance counters is available on almost all modern processors. Although these counters are originally designed for performance tuning, they have also been used for evaluating power consumption. We propose two approaches for modelling and understanding the behaviour of high performance computing (HPC) systems relying on hardware monitoring counters. We evaluate the effectiveness of our system modelling approach considering both optimizing the energy usage of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 35 publications
(11 citation statements)
references
References 19 publications
0
11
0
Order By: Relevance
“…In terms of system components that can be controlled in terms of power and energy, the literature distinguishes frequency, core and uncore [45], disk [53], and network [53]. e latter can also be done through Energy-Efficient Ethernet (EEE) [78] that can turn physical layer devices into a low-power mode with savings up to even 70%-work [78] shows that the overhead of the technology is negligible for many practical scenarios.…”
Section: Classification Of Energy-awarementioning
confidence: 99%
See 3 more Smart Citations
“…In terms of system components that can be controlled in terms of power and energy, the literature distinguishes frequency, core and uncore [45], disk [53], and network [53]. e latter can also be done through Energy-Efficient Ethernet (EEE) [78] that can turn physical layer devices into a low-power mode with savings up to even 70%-work [78] shows that the overhead of the technology is negligible for many practical scenarios.…”
Section: Classification Of Energy-awarementioning
confidence: 99%
“…(3) Cluster [47] Proposes integration of power limitation into a job scheduler and implementation in SLURM [48] Proposes the enhanced power adaptive scheduling (E-PAS) algorithm with integration of power-aware approach into SLURM for limiting power consumption [49] Approach applicable to MPI applications but focusing on states of processes running on CPUs, i.e., reducing power consumption of CPUs on which processes are idle or perform I/O operations [50] Proposes DVFS-aware profiling that uses design time profiling and nonprofiling approach that performs computations at runtime [51] Split compilation is used with offline and online phases, results from the offline-phase passed to runtime optimization, grey box approach to autotuning, and assumes code annotations [52] Proposes a runtime library that performs poweraware optimization at runtime and searches for good configurations with DFS/DCT for application regions [53] Approaches for modeling, monitoring, and tracking HPC systems using performance counters and optimization of energy used in a cluster environment with consideration of CPU, memory, disk, and network [54] Proposed an energy-saving framework with ranking and correlating counters important for improving energy efficiency [55,56] Energy savings on a cluster with Sandy Bridge processors [57] With consideration of disk and network scaling [58] Including disk, memory, processor, or even fans [24] Analysis of performance vs power of a 32-node cluster running a NAS parallel benchmark [59] A procedure for a single device (a compute node with CPU); however, it is dedicated using such devices coupled into a cluster (tested on 8-9 nodes) [60] Homogeneous multicore cluster [61] Cluster [62] Computer system with several nodes each with multicore CPUs [63,64] Cluster with several nodes each with multicore CPUs [65] Cluster with several nodes with CPUs [66,67] Cluster in a data center [68] Sandy Bridge cluster [69] Cluster with InfiniBand [70] Overprovisioned cluster which can run a certain number of nodes at peak power and more at lower power caps [71] Cluster with 1056 Dell PowerEdge SC1425 nodes …”
Section: Optimization Level Work Descriptionmentioning
confidence: 99%
See 2 more Smart Citations
“…A study directed towards cross platform energy usage estimation of individual applications is found in [6]. The authors suggest a model capable of predicting the energy consumption of a given application during the application's execution phase.…”
Section: Related Workmentioning
confidence: 99%