How to parameterize models with bursty workloads

Casale, Giuliano; Mi, Ningfang; Cherkasova, Ludmila; Smirni, Evgenia

doi:10.1145/1453175.1453182

Cited by 25 publications

(24 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…[7] introduces the concept of characterizing burstiness in time series of workload arrivals using the index of dispersion. In [8], [9] the authors describe how the index of dispersion can be used to model and parametrize bursty workloads when investigating service times in multi-tier applications. [10] presents Fasttrack, a dynamic resource provisioning solution for multi-tiered applications that estimates the index of dispersion and utilizes it for determining when the workload is entering and exiting a bursty state.…”

Section: Related Workmentioning

confidence: 99%

Online Spike Detection in Cloud Workloads

Mehta

Dürango

Tordsson

et al. 2015

2015 IEEE International Conference on Cloud Engineering

View full text Add to dashboard Cite

Abstract-We investigate methods for detection of rapid workload increases (load spikes) for cloud workloads. Such rapid and unexpected workload spikes are a main cause for poor performance or even crashing applications as the allocated cloud resources become insufficient. To detect the spikes early is fundamental to perform corrective management actions, like allocating additional resources, before the spikes become large enough to cause problems. For this, we propose a number of methods for early spike detection, based on established techniques from adaptive signal processing. A comparative evaluation shows, for example, to what extent the different methods manage to detect the spikes, how early the detection is made, and how frequently they falsely report spikes.

show abstract

Section: Related Workmentioning

confidence: 99%

Online Spike Detection in Cloud Workloads

Mehta

Dürango

Tordsson

et al. 2015

2015 IEEE International Conference on Cloud Engineering

View full text Add to dashboard Cite

show abstract

“…Casale et al [7] showed how to incorporate burstiness into the analytical queuing network models. These approaches provide sound foundational basis for medium term or offline capacity estimation, in contrast to our adaptive approach based on measured performance.…”

Section: Related Workmentioning

confidence: 99%

Defragmenting the cloud using demand-based resource allocation

Shanmuganathan

Gulati

Varman

2013

Proceedings of the ACM SIGMETRICS/international Conference on Measurement and Modeling of Computer Systems

View full text Add to dashboard Cite

Public clouds sell capacity in the form of pre-defined virtual machine (VM) configurations to their tenants. This forces tenants to buy the VM configuration based on the peak usage. This diminishes the value proposition of moving to a public cloud as compared to doing consolidation in a private virtualized datacenter. Ideally we would like the cloud tenants to buy capacity in bulk and benefit from statistical multiplexing among workloads. This requires dynamic allocation of bulk capacity among VMs of a tenant that may be running on different servers across different datacenters.In this paper, we propose two novel algorithms called BPX and IDD that are able to provide the abstraction of buying bulk capacity to a cloud customer. These algorithms dynamically allocate the overall capacity between VMs based on their demand and user-set importance. Both algorithms are highly scalable and are designed to work in a large scale environment. Our analysis shows that BPX is able to meet all the desirable properties in providing the abstraction. We implemented the prototype of BPX as part of VMware's management software and showed that BPX is able to closely mimic the behavior of a centralized allocator, in a distributed manner. IntroductionConsider an IT department of a small company that is considering moving its workloads to a public cloud. Currently the company is running a private cloud, where they are able to take advantage of server consolidation by using in-house virtualization and cloud management software. Several companies offer solutions in this space, such as Nebula [16] Within the private cloud the VMs run in a controlled physical infrastructure maintained by the IT department of the same company. The private cloud is able to exploit temporal variations in the VM loads to reduce the amount of provisioned resources, by over-committing server CPU and memory. Hypervisors such as VMware ESX Server provide several techniques (like transparent page-sharing, ballooning, compression, and swap-to-SSD) to facilitate high consolidation ratios. The gains from statistical multiplexing benefit the bottom line of the company by reducing both its capital and operating expenses. In a public cloud the physical infrastructure is distributed over one or more mega data centers, supporting thousands of servers and hosting VMs belonging to multiple paying customers. In this situation the benefits of workload multiplexing accrue to the cloud service provider and not directly to the tenant, as the former increases consolidation ratios without regard to specific customers.In the public cloud, the tenant's is forced to purchase VMs based on their configured sizes. Typically VMs are configured for peak usage and the consolidation helps because VMs can use resource from each other based on their run-time demands on the host. An interesting study [12] on buying capacity in terms of fixed T-shirt sizes vs. doing time sharing showed that buying per VM capacity can be twice as expensive as compared to having a time sharing system. Lets...

show abstract

“…Temporal burstiness is the tendency of job arrivals to occur in clusters, or bursts, separated by long periods of relatively few or no arrivals [10], [19]. In fact, there always exist bursts in real workloads due to the occurrence of bags-of-tasks and idle periods during nights, weekends, holidays, etc.…”

Section: B Temporal and Spatial Burstinessmentioning

confidence: 99%

A Realistic Integrated Model of Parallel System Workloads

Minh

Wolters

Epema

2010

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing

View full text Add to dashboard Cite

Abstract-Performance evaluation is a significant step in the study of scheduling algorithms in large-scale parallel systems ranging from supercomputers to clusters and grids. One of the key factors that have a strong effect on the evaluation results is the workloads (or traces) used in experiments. In practice, several researchers use unrealistic synthetic workloads in their scheduling evaluations because they lack models that can help generate realistic synthetic workloads. In this paper we propose a full model to capture the following characteristics of real parallel system workloads: 1) long range dependence in the job arrival process, 2) temporal and spatial burstiness, 3) bag-oftasks behaviour, and 4) correlation between the runtime and the number of processors. Validation of our model with real traces shows that our model not only captures the above characteristics but also fits the marginal distributions well. In addition, we also present an approach to quantify burstiness in a job arrival process (temporal) as well as burstiness in the load of a trace (spatial).

show abstract

How to parameterize models with bursty workloads

Cited by 25 publications

References 17 publications

Online Spike Detection in Cloud Workloads

Online Spike Detection in Cloud Workloads

Defragmenting the cloud using demand-based resource allocation

A Realistic Integrated Model of Parallel System Workloads

Contact Info

Product

Resources

About