2014
DOI: 10.1007/978-3-319-09581-3_9
|View full text |Cite
|
Sign up to set email alerts
|

An Architecture for Automatic Scaling of Replicated Services

Abstract: Replicated services that allow to scale dynamically can adapt to requests load. Choosing the right number of replicas is fundamental to avoid performance worsening when input spikes occur and to save resources when the load is low. Current mechanisms for automatic scaling are mostly based on fixed thresholds on CPU and memory usage, which are not sufficiently accurate and often entail late countermeasures. We propose Make Your Service Elastic (MYSE), an architecture for automatic scaling of generic replicated … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 15 publications
(13 citation statements)
references
References 30 publications
0
13
0
Order By: Relevance
“…Service time is the time a server spends on processing the request, which is widely used in the queuing models to approximate the average response time or sojourn time. Except for a few works [Gergin et al 2014;Han et al 2014] that assume this metric as known a priori, to accurately measure it, either offline profiling [Prodan and Nae 2009] or support from the application [Aniello et al 2014] is required. Therefore, instead of directly probing it, some works use other approaches to approximate it.…”
Section: High-level Metricsmentioning
confidence: 99%
See 1 more Smart Citation
“…Service time is the time a server spends on processing the request, which is widely used in the queuing models to approximate the average response time or sojourn time. Except for a few works [Gergin et al 2014;Han et al 2014] that assume this metric as known a priori, to accurately measure it, either offline profiling [Prodan and Nae 2009] or support from the application [Aniello et al 2014] is required. Therefore, instead of directly probing it, some works use other approaches to approximate it.…”
Section: High-level Metricsmentioning
confidence: 99%
“…[Kamra et al 2004], [Villela et al 2007], [Sharma et al 2012], [Gandhi et al 2014a;Gandhi et al 2014b], and [Gergin et al 2014] employed this method. Some described the cluster using a queue with multiple servers, like [Ali-Eldin et al 2012b], [Jiang et al 2013], [Aniello et al 2014], and [Han et al 2014]. Other works modeled each server as a separate queue, such as [Doyle et al 2003], [Urgaonkar et al 2008], [Roy et al 2011], [Ghanbari et al 2012], [Kaur and Chana 2014], [Spinner et al 2014], and [Jiang et al 2010].…”
Section: Analytical Modelingmentioning
confidence: 99%
“…This article describes a cloud elasticity initiative called AutoElastic, which is evaluated using the proposed metrics. Thus, we organized this section in two groups: (i) performance and/or energy‐driven cloud computing middleware and models ; (ii) metrics to measure energy consumption and cost in cloud environments . These groups are addressed in Sections 2.1 and 2.2.…”
Section: Related Workmentioning
confidence: 99%
“…By combining feed‐forward control and feedback control, ElastMan addresses the challenges of the variable performance of cloud VMs. Aniello et al developed an architecture for automatically scaling replicated services. A queuing model of the replicated service is used to compute the expected response time, given the current configuration (number of replicas) and the distributions of both input requests and service times.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation