On the use of hybrid reinforcement learning for autonomic resource allocation

Tesauro, Gerald; Jong, Nicholas K.; Das, Rajarshi; Bennani, Mohamed N.

doi:10.1007/s10586-007-0035-6

Cited by 97 publications

(70 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…In [29], a model based on queuing theory is corrected at runtime by exploiting online reinforcement learning to determine the batching level delivering lowest latencies for a Total-Order based broadcast primitive. A similar approach is undertaken in [30], where the target is optimizing resource provisioning in a distributed application and the online learner is based on Q-learning. In [31], [32] analytical models are complemented at runtime by decision tree regressors, in the former case with the purpose of optimizing the global multiprogramming level for distributed transactional applications, in the latter to allow a continuous validation and correction of difference performance predictors in a data center.…”

Section: Reconfiguration Managermentioning

confidence: 99%

Self-Tuning Transactional Data Grids: The Cloud-TM Approach

Didona

Romano

2014

2014 IEEE 3rd Symposium on Network Cloud Computing and Applications (Ncca 2014)

View full text Add to dashboard Cite

Abstract-In this paper we focus on the problem of selftuning distributed transactional cloud data stores by presenting an overview of the autonomic mechanisms integrated in the Cloud-TM platform, a transactional cloud data store developed in the context of a recent European project.Cloud-TM takes a holistic approach to self-tuning and elastic scaling, treating them as strongly intertwined problems with the ultimate goals of i) achieving optimal efficiency at any scale of the platform, and ii) minimizing resource consumption in presence of varying workloads. From a methodological perspective, this is achieved by relying on the innovative idea of exploiting the diversity of different modelling approaches, including analytical models, machine-learning and simulations. By employing these modelling techniques in synergy, the Cloud-TM platform can dynamically optimize the underlying distributed data store over a number of dimensions, including its scale, the strategy it adopts to distribute and replicate data among the platforms' nodes, as well as its replication protocol.

show abstract

Section: Reconfiguration Managermentioning

confidence: 99%

Self-Tuning Transactional Data Grids: The Cloud-TM Approach

Didona

Romano

2014

2014 IEEE 3rd Symposium on Network Cloud Computing and Applications (Ncca 2014)

View full text Add to dashboard Cite

show abstract

“…While resource allocation is more concerned with low level scheduling of tasks at the virtual machine level, the parallels between them still merit their inclusion. Tesauro investigated the use of a hybrid reinforcement learning technique for autonomic resource allocation [15]. He applied this research to optimizing server allocation in data centers.…”

Section: Background Researchmentioning

confidence: 99%

A Learning Architecture for Scheduling Workflow Applications in the Cloud

Barrett

Howley

Duggan

2011

2011 IEEE Ninth European Conference on Web Services

View full text Add to dashboard Cite

Abstract-The scheduling of workflow applications involves the mapping of individual workflow tasks to computational resources, based on a range of functional and non-functional quality of service requirements. Workflow applications require extensive computational requirements, and often involve the processing of significant amounts of data. Furthermore, dependencies that exist amongst tasks require that schedules must be generated strictly in accordance with defined precedence constraints. The emergence of cloud computing has introduced a utility-type market model, where computational resources of varying capacities can be procured on demand, in a pay-per-use fashion. In general the two most important objectives of workflow schedulers are the minimisation of both cost and makespan. As well as computational costs incurred from processing individual tasks, workflow schedulers must also plan for data transmission costs where potentially large amounts of data must be transferred between compute and storage sites. This paper proposes a novel cloud workflow scheduling approach which employs a Markov Decision Process to optimally guide the workflow execution process depending on environmental state. In addition the system employs a genetic algorithm to evolve workflow schedules. The overall architecture is presented, and initial results indicate the potential of this approach for developing viable workflow schedules on the Cloud.

show abstract

“…To efficiently calculate those trajectories, we could again leverage machine learning techniques, in particular Reinforcement Learning (RL). RL has been successfully used for Autonomic Computing in the recent past to synthesize policies for decision-making, aiming once again at self-optimization [27] [28]. A key concept in RL is "reward", a scalar that valuates the observed consequence of a decision, and which the learning agent making decisions aims to maximize.…”

Section: Ongoing and Future Workmentioning

confidence: 99%

Synthesis of application-level utility functions for autonomic self-assessment

2010

View full text Add to dashboard Cite

We present a non-analytic approach to self-assessment for Autonomic Computing. Our approach leverages utility functions, at the level of an autonomic application, or even a single task or feature performed by that application. This paper describes the fundamental steps of our approach: instrumentation of the application; collection of exhaustive samples of runtime data about relevant quality attributes of the application, as well as characteristics of its runtime environment; synthesis of a utility function through statistical correlation over the collected data points; and embedding of code corresponding to the equation of the synthesized utility function within the application, which enables the computation of utility values at run time. We employ a number of case studies, with their results and implications, to motivate and discuss the significance of application-level utility, illustrate our statistical synthesis method, and describe our framework for instrumentation, monitoring, and utility function embedding/evaluation.

show abstract

On the use of hybrid reinforcement learning for autonomic resource allocation

Cited by 97 publications

References 26 publications

Self-Tuning Transactional Data Grids: The Cloud-TM Approach

Self-Tuning Transactional Data Grids: The Cloud-TM Approach

A Learning Architecture for Scheduling Workflow Applications in the Cloud

Synthesis of application-level utility functions for autonomic self-assessment

Contact Info

Product

Resources

About