Power efficient job scheduling by predicting the impact of processor manufacturing variability

Chasapis, Dimitrios; Moretó, Miquel; Schulz, Martin; Rountree, Barry; Valero, Mateo; Casas, Marc

doi:10.1145/3330345.3330372

Cited by 22 publications

(12 citation statements)

References 46 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…There are a broad group of knowledge bases on the Internet, for example, Kaggle, 5 UCI Repository, 6 Quandl, 7 and MSCOCO. 8…”

Section: Knowledge Base Constructionmentioning

confidence: 99%

“…The scheduling problem, in a general view, comprises both a set of resources and a set of consumers [8]. Its focus is to find an appropriate policy to manage the use of resources by several consumers in order to optimize a particular performance metric chosen as a parameter.…”

Section: Exploring Scheduling and Load Balancing On Data Science Demandsmentioning

confidence: 99%

“…We define static scheduling considering the scheduling grain as a task [8]. If data such as information about the processors, the execution time of the tasks, the size of the data, the communication pattern, and the dependency relation among the tasks are known in advance, we can affirm that we have a static or deterministic scheduling model.…”

Section: Exploring Scheduling and Load Balancing On Data Science Demandsmentioning

confidence: 99%

“…https://www.kaggle.com6 https://archive.ics.uci.edu/ml/index.php7 https://www.quandl.com/8 http://cocodataset.org/home7Looking at Data Science through the Lens of Scheduling and Load Balancing DOI: http://dx.doi.org/10.5772/intechopen.92578 …”

mentioning

confidence: 99%

See 3 more Smart Citations

Looking at Data Science through the Lens of Scheduling and Load Balancing

Silveira¹,

Reis²,

Bavaresco³

et al. 2020

Scheduling Problems - New Applications and Trends

View full text Add to dashboard Cite

The growth in data generated by private and public organizations leads to several opportunities to obtain valuable knowledge. In this scenario, data science becomes pertinent to define a structured methodology to extract valuable knowledge from raw data. It encompasses a heterogeneous group of techniques that challenge the implementation of a single platform capable of incorporating all the available resources. Thus, it is necessary to formulate a data science workflow based on different tools to extract knowledge from massive datasets. In this context, highperformance computing (HPC) provides the infrastructure required to optimize the processing time of data science workflows, which become a collection of tasks that must be efficiently scheduled to provide results in acceptable time intervals. While few studies explore the use of HPC for data science tasks, in the best of our knowledge, none conducts an in-depth analysis of scheduling and load balancing on such workflows. In this context, this chapter proposes an analysis of scheduling and load balancing from the perspective of data science scenarios. It presents concepts, environments, and tools to summarize the theoretical background required to define, assign, and execute data science workflows. Furthermore, we are also presenting new trends concerning the intersection of data science, scheduling, and load balance.

show abstract

“…There are a broad group of knowledge bases on the Internet, for example, Kaggle, 5 UCI Repository, 6 Quandl, 7 and MSCOCO. 8…”

Section: Knowledge Base Constructionmentioning

confidence: 99%

Section: Exploring Scheduling and Load Balancing On Data Science Demandsmentioning

confidence: 99%

Section: Exploring Scheduling and Load Balancing On Data Science Demandsmentioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

Looking at Data Science through the Lens of Scheduling and Load Balancing

Silveira¹,

Reis²,

Bavaresco³

et al. 2020

Scheduling Problems - New Applications and Trends

View full text Add to dashboard Cite

show abstract

“…Intel Turbo Boost technology, AMD SenseMI technology, etc. [6,7]. However, with the rapid growth of mobile applications and demands on energy efficiency [8,9], such vendor strategies no longer work well for mobile devices:…”

Section: Introductionmentioning

confidence: 99%

An Efficient and Flexible Learning Framework for Dynamic Power and Thermal Co-Management

Cao

Shen

Zhang

et al. 2020

Proceedings of the 2020 ACM/IEEE Workshop on Machine Learning for CAD

View full text Add to dashboard Cite

At the era of Artificial Intelligence and Internet of Things (AIoT), battery-powered mobile devices are required to perform more sophisticated tasks featured with fast varying workloads and constrained power supply, demanding more efficient run-time power management. In this paper, we propose a deep reinforcement learning framework for dynamic power and thermal co-management. We build several machine learning models that incorporate the physical details for an ARM Cortex-A72, with on average 3% and 1% error for power and temperature predictions, respectively. We then build an efficient deep reinforcement learning control incorporating the machine learning models and facilitating the run-time dynamic voltage and frequency scaling (DVFS) strategy selection based on the predicted power, workloads and temperature. We evaluate our proposed framework, and compare the performance with existing management methods. The results suggest that our proposed framework can achieve 6.8% performance improvement compared with other alternatives. CCS CONCEPTS • Computing methodologies → Modeling and simulation; • Hardware → Chip-level power issues; • Computer systems organization → Embedded systems.

show abstract

AOA: Adaptive Overclocking Algorithm on CPU-GPU Heterogeneous Platforms

Chen

Sun

et al. 2023

Algorithms and Architectures for Parallel Processing

View full text Add to dashboard Cite

Although GPUs have been used to accelerate various convolutional neural network algorithms with good performance, the demand for performance improvement is still continuously increasing. CPU/GPU overclocking technology brings opportunities for further performance improvement in CPU-GPU heterogeneous platforms. However, CPU/GPU overclocking inevitably increases the power of the CPU/GPU, which is not conducive to energy conservation, energy efficiency optimization, or even system stability. How to effectively constrain the total energy to remain roughly unchanged during the CPU/GPU overclocking is a key issue in designing adaptive overclocking algorithms. There are two key factors during solving this key issue. Firstly, the dynamic power upper bound must be set to reflect the real-time behavior characteristics of the program so that algorithm can better meet the total energy unchanging constraints; secondly, instead of independently overclocking at both CPU and GPU sides, coordinately overclocking on CPU-GPU must be considered to adapt to real-time load balance for higher performance improvement and better energy constraints. This paper proposes an Adaptive Overclocking Algorithm (AOA) on CPU-GPU heterogeneous platforms to achieve the goal of performance improvement while the total energy remains roughly unchanged. AOA uses the function $$F_k$$ F k to describe the variable power upper bound and introduces the load imbalance factor W to realize the CPU-GPU coordinated overclocking. Through the verification of several types convolutional neural network algorithms on two CPU-GPU heterogeneous platforms (Intel$$^\circledR $$ ® Xeon E5-2660 & NVIDIA$$^\circledR $$ ® Tesla K80; Intel$$^\circledR $$ ® Core™i9-10920X & NIVIDIA$$^\circledR $$ ® GeForce RTX 2080Ti), AOA achieves an average of 10.7% performance improvement and 4.4% energy savings. To verify the effectiveness of the AOA, we compare AOA with other methods including automatic boost, the highest overclocking and static optimal overclocking.

show abstract

Power efficient job scheduling by predicting the impact of processor manufacturing variability

Cited by 22 publications

References 46 publications

Looking at Data Science through the Lens of Scheduling and Load Balancing

Looking at Data Science through the Lens of Scheduling and Load Balancing

An Efficient and Flexible Learning Framework for Dynamic Power and Thermal Co-Management

AOA: Adaptive Overclocking Algorithm on CPU-GPU Heterogeneous Platforms

Contact Info

Product

Resources

About