Applying reinforcement learning towards automating resource allocation and application scalability in the cloud

Barrett, Enda; Howley, Enda; Duggan, James

doi:10.1002/cpe.2864

Cited by 184 publications

(112 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…ii A cloud controller reasoning engine (RobusT2Scale [18]) implemented in Matlab. 2 iii A cloud-based application framework (ElasticBench) implemented with Microsoft .NET technologies (.NET framework 4 and Azure SDK 2.5). 3 iv The integration between these three components by software connectors (cf.…”

Section: Methodsmentioning

confidence: 99%

Fuzzy Self-Learning Controllers for Elasticity Management in Dynamic Cloud Architectures

Jamshidi

Sharifloo

Pahl

et al. 2016

2016 12th International ACM SIGSOFT Conference on Quality of Software Architectures (QoSA)

View full text Add to dashboard Cite

Abstract-Cloud controllers support the operation and quality management of dynamic cloud architectures by automatically scaling the compute resources to meet performance guarantees and minimize resource costs. Existing cloud controllers often resort to scaling strategies that are codified as a set of architecture adaptation rules. However, for a cloud provider, deployed application architectures are black-boxes, making it difficult at design time to define optimal or pre-emptive adaptation rules. Thus, the burden of taking adaptation decisions often is delegated to the cloud application. We propose the dynamic learning of adaptation rules for deployed application architectures in the cloud. We introduce FQL4KE, a self-learning fuzzy controller that learns and modifies fuzzy rules at runtime. The benefit is that we do not have to rely solely on precise design-time knowledge, which may be difficult to acquire. FQL4KE empowers users to configure cloud controllers by simply adjusting weights representing priorities for architecture quality instead of defining complex rules. FQL4KE has been experimentally validated using the cloud application framework ElasticBench in Azure and OpenStack. The experimental results demonstrate that FQL4KE outperforms both a fuzzy controller without learning and the native Azure auto-scaling.

show abstract

Section: Methodsmentioning

confidence: 99%

Fuzzy Self-Learning Controllers for Elasticity Management in Dynamic Cloud Architectures

Jamshidi

Sharifloo

Pahl

et al. 2016

2016 12th International ACM SIGSOFT Conference on Quality of Software Architectures (QoSA)

View full text Add to dashboard Cite

show abstract

“…However, the prediction accuracy highly depends on the number observations and the interval [18]. Reinforcement learning (e.g., [42]) enable learning elasticity policies from observations. However, it requires long learning, which is only applicable for stable workloads.…”

Section: Related Workmentioning

confidence: 99%

Autonomic resource provisioning for cloud-based software

Jamshidi

Ahmad

Pahl

2014

Proceedings of the 9th International Symposium on Software Engineering for Adaptive and Self-Managing Systems

122

131

View full text Add to dashboard Cite

Cloud elasticity provides a software system with the ability to maintain optimal user experience by automatically acquiring and releasing resources, while paying only for what has been consumed. The mechanism for automatically adding or removing resources on the fly is referred to as auto-scaling. The state-of-thepractice with respect to auto-scaling involves specifying thresholdbased rules to implement elasticity policies for cloud-based applications. However, there are several shortcomings regarding this approach. Firstly, the elasticity rules must be specified precisely by quantitative values, which requires deep knowledge and expertise. Furthermore, existing approaches do not explicitly deal with uncertainty in cloud-based software, where noise and unexpected events are common. This paper exploits fuzzy logic to enable qualitative specification of elasticity rules for cloud-based software. In addition, this paper discusses a control theoretical approach using type-2 fuzzy logic systems to reason about elasticity under uncertainties. We conduct several experiments to demonstrate that cloud-based software enhanced with such elasticity controller can robustly handle unexpected spikes in the workload and provide acceptable user experience. This translates into increased profit for the cloud application owner.

show abstract

“…Barrett et al [6] consider a Q-learning approach to the auto-scaling of cloud applications deployment. Seeking to address the dimensionality issues associated with Reinforcement Learning approaches by adopting a hybrid approach.…”

Section: Related Workmentioning

confidence: 99%

Using Machine Learning in Trace-driven Energy-Aware Simulations of High-Throughput Computing Systems

McGough

Moubayed

Forshaw

2017

Proceedings of the 8th ACM/SPEC on International Conference on Performance Engineering Companion

View full text Add to dashboard Cite

Use policyThe full-text may be used and/or reproduced, and given to third parties in any format or medium, without prior permission or charge, for personal research or study, educational, or not-for-prot purposes provided that:• a full bibliographic reference is made to the original source • a link is made to the metadata record in DRO • the full-text is not changed in any way The full-text must not be sold in any format or medium without the formal permission of the copyright holders.Please consult the full DRO policy for further details. ABSTRACTWhen performing a trace-driven simulation of a High Throughput Computing system we are limited to the knowledge which should be available to the system at the current point within the simulation. However, the trace-log contains information we would not be privy to during the simulation. Through the use of Machine Learning we can extract the latent patterns within the trace-log allowing us to accurately predict characteristics of tasks based only on the information we would know. These characteristics will allow us to make better decisions within simulations allowing us to derive better policies for saving energy. We demonstrate that we can accurately predict (up-to 99% accuracy), using oversampling and deep learning, those tasks which will complete while at the same time provide accurate predictions for the task execution time and memory footprint using Random Forest Regression.

show abstract

Applying reinforcement learning towards automating resource allocation and application scalability in the cloud

Cited by 184 publications

References 23 publications

Fuzzy Self-Learning Controllers for Elasticity Management in Dynamic Cloud Architectures

Fuzzy Self-Learning Controllers for Elasticity Management in Dynamic Cloud Architectures

Autonomic resource provisioning for cloud-based software

Using Machine Learning in Trace-driven Energy-Aware Simulations of High-Throughput Computing Systems

Contact Info

Product

Resources

About