Adaptive and Hierarchical Runtime Manager for Energy-Aware Thermal Management of Embedded Systems

Das, Anup; Al-Hashimi, Bashir M.; Merrett, Geoff V.

doi:10.1145/2834120

Cited by 46 publications

(26 citation statements)

References 46 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Statically optimized resource and power management are not likely to achieve the best performance when the input characteristics are changing. As a result reinforcement learning has been used for DPM [22][23][24][25][26], DVFS [18-21,52], or combination of DPM, DVFS and mapping [28,53,54] in embedded, desktop and datacenter domains. A detailed classification of existing RL based approaches for power/energy management is given Table 2.…”

Section: Reinforcement Learning For Run-time Managementmentioning

confidence: 99%

“…Several principles have been followed for shutting the cores down, for example, greedy approach where a core enters into sleep mode as soon as processing on the core is finished and timeout approach that enters the core into sleep mode after certain time of idleness if no request is received within that time. Out of mapping, DVFS and DPM, they have been applied individually and in combinations as well, e.g., mapping in [15,16] and both mapping and DVFS in [27,28].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Learning-Based Run-Time Power and Energy Management of Multi/Many-Core Systems: Current and Future Trends

Singh¹,

Leech²,

Basireddy³

et al. 2017

Journal of Low Power Electronics

Self Cite

View full text Add to dashboard Cite

Abstract-Multi/Many-core systems are prevalent in several application domains targeting different scales of computing such as embedded and cloud computing. These systems are able to fulfil the ever-increasing performance requirements by exploiting their parallel processing capabilities. However, effective power/energy management is required during system operations due to several reasons such as to increase the operational time of battery operated systems, reduce the energy cost of datacenters, and improve thermal efficiency and reliability. This article provides an extensive survey of learning-based run-time power/energy management approaches. The survey includes a taxonomy of the learning-based approaches. These approaches perform design-time and/or run-time power/energy management by employing some learning principles such as reinforcement learning. The survey also highlights the trends followed by the learning-based run-time power management approaches, their upcoming trends and open research challenges.

show abstract

Section: Reinforcement Learning For Run-time Managementmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Learning-Based Run-Time Power and Energy Management of Multi/Many-Core Systems: Current and Future Trends

Singh¹,

Leech²,

Basireddy³

et al. 2017

Journal of Low Power Electronics

Self Cite

View full text Add to dashboard Cite

show abstract

“…For instance, reducing may cause an increase in soft error rate (SER) [6]. Conversely, increasing causes an increase of temperature, accelerates aging and the probability of breakdowns [7]. This leads to min ≤ ≤ max ( 1 ) An execution may require more than a certain level of throughput to be meaningful [8].…”

Section: Introductionmentioning

confidence: 99%

Voltage, Throughput, Power, Reliability, and Multicore Scaling

et al. 2017

View full text Add to dashboard Cite

ARM 1 AbstractParallelization has been used to maintain a reasonable balance between energy consumption and performance in computing platforms especially in modern multi-and many-core systems. This paper studies the interplay between performance and energy, and their relationships with parallelization scaling in the context of the reliable operating region, focusing on the effectiveness of parallelization scaling in throughput-power tradeoffs. Theoretical and experimental explorations show that a meaningful cross-platform analysis of this interplay can be achieved using the proposed method of binormalization of the ROR. The concept of this interplay is captured in an online tool for finding optimal operating points. IntroductionIn digital CMOS circuits, a higher supply voltage (called henceforth) usually permits a higher operating (clock) frequency for capacitive load-balancing, and hence a higher throughput, given the same hardware platform. The scheme of dynamic voltage and frequency scaling (DVFS) scales and clock frequency (henceforth called ) together in order to obtain the best throughput under a given power budget or to save power for a given throughput requirement [1].It is possible to increase system throughput for a given power limit, or to reduce power whilst maintaining throughput, by combining DVFS with parallelization or scaling to multiple computation units if the computation can be parallelized [2]. A major challenge for the precise analysis of the effectiveness of using parallelization for these goals is to determine the parallelizability of any particular execution, which is related to complex issues such as software and hardware architecture details and must be modelled on a per-execution basis [3]. Another challenge is that quantitative studies of power and/or throughput improvements for any DVFS decision need complicated executiondependent models [4]. This paper explores the interplay between DVFS and parallelization scalability with respect to performance and power. The interplay is captured using the concept of a reliable operating region (ROR), which can be established from the knowledge of system reliability through experiments or simulations. The ROR therefore provides containment for platform and application specifics, hence helping to make the further analysis steps generic.The focus of this paper is the effectiveness of parallelization scaling, the latter denoted as .The ROR-based method can explore across the entire voltage range of a platform, from subthreshold to super-threshold regions. The explorations and models presented in this paper confirm and explain the general view that combined DVFS and parallelization scaling produces the best advantage when is scaled down to near-threshold voltages. This is known as near-threshold

show abstract

“…The framework is based upon the Reinforcement Learning (RL) approach described in [11], [12]. The framework invokes workload prediction and appropriate V-F control to achieve energy minimisation for applications executed on a multi-core hardware platform.…”

Section: Introductionmentioning

confidence: 99%

Machine learning for run-time energy optimisation in many-core systems

Biswas

Balagopal

Shafik

et al. 2017

Design, Automation &Amp; Test in Europe Conference &Amp; Exhibition (DATE), 2017

Self Cite

View full text Add to dashboard Cite

Abstract-In recent years, the focus of computing has moved away from performance-centric serial computation to energyefficient parallel computation. This necessitates run-time optimisation techniques to address the dynamic resource requirements of different applications on many-core architectures. In this paper, we report on intelligent run-time algorithms which have been experimentally validated for managing energy and application performance in many-core embedded system. The algorithms are underpinned by a crosslayer system approach where the hardware, system software and application layers work together to optimise the energyperformance trade-off. Algorithm development is motivated by the biological process of how a human brain (acting as an agent) interacts with the external environment (system) changing their respective states over time. This leads to a pay-off for the action taken, and the agent eventually learns to take the optimal/best decisions in future. In particular, our online approach uses a model-free reinforcement learning algorithm that suitably selects the appropriate voltage-frequency scaling based on workload prediction to meet the applications' performance requirements and achieve energy savings of up to 16% in comparison to stateof-the-art-techniques, when tested on four ARM A15 cores of an ODROID-XU3 platform.

show abstract

Adaptive and Hierarchical Runtime Manager for Energy-Aware Thermal Management of Embedded Systems

Cited by 46 publications

References 46 publications

Learning-Based Run-Time Power and Energy Management of Multi/Many-Core Systems: Current and Future Trends

Learning-Based Run-Time Power and Energy Management of Multi/Many-Core Systems: Current and Future Trends

Voltage, Throughput, Power, Reliability, and Multicore Scaling

Machine learning for run-time energy optimisation in many-core systems

Contact Info

Product

Resources

About