Proceedings of the 2017 ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems 2017
DOI: 10.1145/3078505.3078532
|View full text |Cite
|
Sign up to set email alerts
|

Optimal Service Elasticity in Large-Scale Distributed Systems

Abstract: A fundamental challenge in large-scale cloud networks and data centers is to achieve highly efficient server utilization and limit energy consumption, while providing excellent userperceived performance in the presence of uncertain and timevarying demand patterns. Auto-scaling provides a popular paradigm for automatically adjusting service capacity in response to demand while meeting performance targets, and queue-driven auto-scaling techniques have been widely investigated in the literature. In typical data c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
32
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 28 publications
(32 citation statements)
references
References 38 publications
0
32
0
Order By: Relevance
“…Broader aspects of large data center and cloud optimization have been pushed to the forefront ever since the appearance of public clouds, captivating both the research community and the industry. Challenges such as auto-scaling [24,27], job scheduling [31], energy consumption [37] to name a few are interleaved with budgeting issues [13] and other concerns. When it comes to service pricing, as mentioned previously, cloud providers do not offer at this time quality of service guarantees in terms of data delivery.…”
Section: Relaysmentioning
confidence: 99%
“…Broader aspects of large data center and cloud optimization have been pushed to the forefront ever since the appearance of public clouds, captivating both the research community and the industry. Challenges such as auto-scaling [24,27], job scheduling [31], energy consumption [37] to name a few are interleaved with budgeting issues [13] and other concerns. When it comes to service pricing, as mentioned previously, cloud providers do not offer at this time quality of service guarantees in terms of data delivery.…”
Section: Relaysmentioning
confidence: 99%
“…The process Q(t) is a timeinhomogenous density dependent population process; see e.g. [27,28,29] for recent queueing theoretic applications of such processes. We can study the fluid limit of the system, that is, we consider a sequence of models Q K (t), the Kth model having arrival rate λ f (t)K for the f th flow and departure rate K μn (x/K) from the nth neighbourhood when there are x vehicles in that neighbourhood.…”
Section: Fluid Limitmentioning
confidence: 99%
“…The PTC policy is more advanced but recently has been applied to the cluster management practice [27]; it can achieve a higher utilization of servers [24]. More discussion on dispatching policies can be found in [14,15,18,25]. Once the job j is dispatched to a server, the server will be occupied by the on-demand job owner during the period [a j , d j ], i.e., from the beginning of slot a j until the end of slot d j .…”
Section: Dispatching High Priority Of On-demand Jobsmentioning
confidence: 99%