2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) 2016
DOI: 10.1109/ccgrid.2016.11
|View full text |Cite
|
Sign up to set email alerts
|

DieHard: Reliable Scheduling to Survive Correlated Failures in Cloud Data Centers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2017
2017
2019
2019

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 35 publications
(18 citation statements)
references
References 10 publications
0
18
0
Order By: Relevance
“…Beyond energy consumption, a further objective in some works is to minimize the number of overloaded PMs because of the performance degradation that results from overloads . Some works also considered the cost of migration of VMs, or reliability Service level agreement (SLA) handling.…”
Section: Related Workmentioning
confidence: 99%
“…Beyond energy consumption, a further objective in some works is to minimize the number of overloaded PMs because of the performance degradation that results from overloads . Some works also considered the cost of migration of VMs, or reliability Service level agreement (SLA) handling.…”
Section: Related Workmentioning
confidence: 99%
“…Second, the use of alternative actuators (e.g., Running Average Power Limit (RAPL) that allows control the power consumption of CPU sockets and DRAM [46]) is planned to be evaluated against already established techniques for energy efficiency in data center server environments. Third, the power-performance tradeoffs of innovative approaches increasing the performance predictability [47,48] or reliability [49,50] of cloud computing systems should be assessed to understand at what cost the proposed improvements come. Finally, we plan to investigate novel power and performance aware load-balancing algorithms.…”
Section: Discussionmentioning
confidence: 99%
“…[88] provide a theoretical framework for the allocation of batch and service jobs in a set of constrained resources where some resources can be attacked or fail. Achieving a target reliability level can simply be a matter of placing extra replicas in different failure domains [139]. Defining failure domains at the edge, especially with end-user devices, can be difficult and requires proper observability and late characterisation of the failure modes of the devices.…”
Section: Eventually Consistent/probabilistic Orchestrationmentioning
confidence: 99%