2013 IEEE International Conference on Cluster Computing (CLUSTER) 2013
DOI: 10.1109/cluster.2013.6702627
|View full text |Cite
|
Sign up to set email alerts
|

Thermal aware automated load balancing for HPC applications

Abstract: Abstract-As we move towards the exascale era, power and energy have become major challenges. Some of the supercomputers draw more than 10 megawatts, leading to high energy bills. A significant portion of this energy is spent in cooling. In this paper, we propose an adaptive control system that minimizes the cooling energy by using Dynamic Voltage and Frequency Scaling to control the temperature and performing load balancing. This framework, which is a part of the adaptive runtime system, monitors the system an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
6
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 19 publications
(9 citation statements)
references
References 11 publications
0
9
0
Order By: Relevance
“…To support tasks, we created a task queue [13] on each PE, which is distinct from the normal message queue. The messages in the message queue are meant for that specific PE, whereas the tasks in the task queue can be stolen by different cores on a node.…”
Section: Task Queuementioning
confidence: 99%
“…To support tasks, we created a task queue [13] on each PE, which is distinct from the normal message queue. The messages in the message queue are meant for that specific PE, whereas the tasks in the task queue can be stolen by different cores on a node.…”
Section: Task Queuementioning
confidence: 99%
“…This can introduce significant amount of load imbalance in a tightly coupled application. To mitigate the load imbalance, the RTS monitors the application characteristics and whenever it detects load imbalance, it triggers a call to the load balancer [10].…”
Section: Power Awarenessmentioning
confidence: 99%
“…Indeed, Distem is able to emulate DVFS at core level. A similar experiment using DVFS is presented in [13]. However, it could not be performed at the core level since DVFS was only available at the socket level.…”
Section: Related Workmentioning
confidence: 99%