2017
DOI: 10.1145/3084442
|View full text |Cite
|
Sign up to set email alerts
|

Portfolio-driven Resource Management for Transient Cloud Servers

Abstract: Cloud providers have begun to offer their surplus capacity in the form of low-cost transient servers, which can be revoked unilaterally at any time. While the low cost of transient servers makes them attractive for a wide range of applications, such as data processing and scientific computing, failures due to server revocation can severely degrade application performance. Since different transient server types offer different cost and availability tradeoffs, we present the notion of server portfolios that is b… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
17
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
2
2

Relationship

1
9

Authors

Journals

citations
Cited by 26 publications
(17 citation statements)
references
References 40 publications
0
17
0
Order By: Relevance
“…Systems for handling transient server revocation use a combination of fault tolerance and resource allocation to mitigate the performance and cost effects of preemptions. Prior work has focused on system [43,45] and application [23,30,40,41,49,51] support for handling preemptions. We believe that deflatable VMs minimize the need for such middleware, and can avoid the performance, development, and deployment costs associated with preemption.…”
Section: Related Workmentioning
confidence: 99%
“…Systems for handling transient server revocation use a combination of fault tolerance and resource allocation to mitigate the performance and cost effects of preemptions. Prior work has focused on system [43,45] and application [23,30,40,41,49,51] support for handling preemptions. We believe that deflatable VMs minimize the need for such middleware, and can avoid the performance, development, and deployment costs associated with preemption.…”
Section: Related Workmentioning
confidence: 99%
“…•Public Cloud: There are several research works that optimize for the resource provisioning cost in the public cloud. These works broadly fall into two categories: (i) tuning the auto-scaling policy based on changing needs (e.g., Spot, On-Demand) [17,29,42,44,45,78,91], (ii) predicting peak loads and offering proactive provisioning based auto-scaling policy [42ś44, 63,79,97]. Fifer uses similar load prediction models and auto-scales containers but with respect to serverless function chains.…”
Section: Related Workmentioning
confidence: 99%
“…There are several research works that optimize for the resource provisioning cost for various tenant applications in the public cloud. These works broadly fall into two categories: (i) tuning the auto-scaling policy based on changing needs [7,9,19,24,25,34,39], (ii) prediction-based proactive provisioning auto-scaling policy [19,23,24,28,35]. Complementary to these approaches, we propose to user serverless functions along with VMs for cost-effective autoscaling.…”
Section: Related Workmentioning
confidence: 99%