Online Power Management for Multi-Cores: A Reinforcement Learning Based Approach

Wang, Yiming; Zhang, Weizhe; Hao, Meng; Wang, Zheng

doi:10.1109/tpds.2021.3092270

Cited by 19 publications

(15 citation statements)

References 57 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Recent research efforts have focused on how to improve this architecture. Methods include: enabling more performance counters to build complex system features [19] [20] [27], using powerful models for prediction [19] [18] [20] [21] [24], designing better control rules [19] [18] [20] [21], or learning control policy based on reinforcement learning [24] [25] [26] [27] [30].…”

Section: Problem Statement and Proposed Approachmentioning

confidence: 99%

“…Based on the previous work, Ramegowda et al [42] implemented and validated the hybrid DVFS method in various embedded devices running the Linux system. Wang et al [27] used Double Q learning to explore the energy-performance optimization for both CPU core and uncore parts. Specifically, they used the instruction per cycle (IPC), and the misses per operation (MPO) [43] as the state measurement of the environment and used IP C 3 W as the reward to describe the tradeoff between energy and performance.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Real-time task scheduling and network device security for complex embedded systems based on deep learning networks

Zhou

2020

Microprocessors and Microsystems

View full text Add to dashboard Cite

Section: Problem Statement and Proposed Approachmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Real-time task scheduling and network device security for complex embedded systems based on deep learning networks

Zhou

2020

Microprocessors and Microsystems

View full text Add to dashboard Cite

“…RELATED WORK There are many studies which rely on power capping, however very few studies rely on dynamic power capping while fewer studies combine uncore frequency scaling to dynamic power capping. In [32] the authors propose to rely on reinforcement learning to get the best energy consumption with uncore frequency and power capping. Instruction Per Cycles (IPC) are used to control performance loss.…”

Section: H Conclusionmentioning

confidence: 99%

Combining Uncore Frequency and Dynamic Power Capping to Improve Power Savings

Guermouche

2022

2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

View full text Add to dashboard Cite

The US Department of Energy sets a limit of 20 to 30 MW for future exascale machines. In order to control their power consumption, modern processors provide many features. Power capping and uncore frequency scaling are examples of such features which allow to limit the power consumed by a processor.In this paper, we propose to combine dynamic power capping to uncore frequency scaling. We propose DUFP, an extension of DUF, an existing tool which dynamically adapts uncore frequency. DUFP dynamically adapts the processor power cap to the application needs. Finally, just like DUF, DUFP can tolerate performance loss up to a user-defined limit. With a controlled impact on performance, DUFP is able to provide power savings with no energy loss.The evaluation of DUFP shows that it manages to stay within the user-defined slowdown limits for most of the studied applications. Moreover, combining uncore frequency scaling to power capping: (i) improves power consumption by up to 13.98 % with additional energy savings for applications where uncore frequency scaling has a limited impact, (ii) improves power consumption by up to 7.90 % compared to using uncore frequency scaling by itself and (iii) leads to more than 5 % power savings at 5 % tolerated slowdown with no energy loss for most applications.

show abstract

“…READEX (Runtime Exploitation of Application Dynamism for Energy-efficient eXascale computing) OpenMP and MPI code instrumenting tools for optimization of energy-aware HPC computing 29 A multi-agent based intelligent energy management framework for a reduction of power of idle or partially loaded CPUs 20 LEO (learning for energy optimization) a framework based on a probabilistic graphical model for obtaining Pareto-optimal power and performance trade-offs 30 A framework implementing two EDP-optimizing (energy delay product) algorithms: SEA and SPRA 31 An extension to SLURM scheduler to implement a "uniform frequency" in different configuration modes 10 Power capping CoPPer framework using power capping and adaptive control to approximate non-linearities in the power and performance relationship 32 PShifter: dynamic redistribution of power budged between cluster nodes using power limitation for faster processes 33 Application optimizations A framework modeling impact of optimization and providing recommendations for energy savings 24 Preparing best application configuration and settings on a GPU 25 Controlling CPU frequency, disk spinning and network speed scaling 34 Hybrids of the above A software/hardware approach with power capping based on a framework that makes decisions on configurations going through nodes 1 A reinforcement learning framework using power capping and uncore frequency scaling for optimization of the power consumption and run time 35 Scheduling/software as well as resource management with the use of RAPL 11 Scheduling kernels within a GPU as well as frequency scaling 16 Subsequently, we provide a concise comparison of selected approaches presented in respective research works to energy-performance oriented optimization in high-performance computing and presentation of the contribution and differences presented by us within this article.…”

Section: Description Of the Toolmentioning

confidence: 99%

DEPO: A dynamic energy‐performance optimizer tool for automatic power capping for energy efficient high‐performance computing

2022

View full text Add to dashboard Cite

In the article we propose an automatic power capping software tool DEPO that allows one to perform runtime optimization of performance and energy related metrics. For an assumed application model with an initialization phase followed by a running phase with uniform compute and memory intensity, the tool performs automatic tuning engaging one of the two exploration algorithms-linear search (LS) and golden section search (GSS), finds a power cap optimizing a given metric and sets it for the remaining computations. The considered metrics include energy (E), energy-delay sum, energy-delay product. We present experimental results obtained for a set of benchmarks that differ in compute and memory intensity-parallel custom built OpenMP implementations of: numerical integration, heat distribution simulation (HEAT), fast Fourier transform (FFT), and additionally NAS parallel benchmarks: CG, MG, BT, SP, and LU. Tests were performed using multi-core CPUs that are representatives of modern servers and the desktop family: 2 × Intel Xeon E5-2670 v3 CPU (Haswell-EP) and Intel i7-9700K CPU (Coffee Lake). The results show that our approach enabled considerable improvements for the tested metrics, for example, for HEAT and Coffee Lake we minimized energy by 50% at the cost of a 15% increase in execution time (LS), for FFT energy was minimized by 40% at a 25.5% increase in execution time (GSS), for SP and Haswell energy was minimized by 25% at the cost of an 18.5% time increase and for Coffee Lake energy was decreased by 56% with a 12% time increase. K E Y W O R D Sautomatic power capping, green computing, HPC, performance-energy trade-off, software tools INTRODUCTIONNowadays, providing high-performance computing (HPC) resources can be expensive, especially when the power required by computing centers exceeds megawatts. Under such circumstances, every method that allows users to decrease power consumption is extremely desirable, and even low energy savings are multiplied by the effects of scale. Thus, new

show abstract

Online Power Management for Multi-Cores: A Reinforcement Learning Based Approach

Cited by 19 publications

References 57 publications

Real-time task scheduling and network device security for complex embedded systems based on deep learning networks

Real-time task scheduling and network device security for complex embedded systems based on deep learning networks

Combining Uncore Frequency and Dynamic Power Capping to Improve Power Savings

DEPO: A dynamic energy‐performance optimizer tool for automatic power capping for energy efficient high‐performance computing

Contact Info

Product

Resources

About