2007
DOI: 10.1007/s10586-007-0035-6
|View full text |Cite
|
Sign up to set email alerts
|

On the use of hybrid reinforcement learning for autonomic resource allocation

Abstract: Reinforcement Learning (RL) provides a promising new approach to systems performance management that differs radically from standard queuing-theoretic approaches making use of explicit system performance models. In principle, RL can automatically learn high-quality management policies without an explicit performance model or traffic model, and with little or no built-in system specific knowledge. In our original work (Das, R.) we showed the feasibility of using online RL to learn resource valuation estimates (… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
65
0
2

Year Published

2008
2008
2018
2018

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 97 publications
(70 citation statements)
references
References 26 publications
0
65
0
2
Order By: Relevance
“…In [29], a model based on queuing theory is corrected at runtime by exploiting online reinforcement learning to determine the batching level delivering lowest latencies for a Total-Order based broadcast primitive. A similar approach is undertaken in [30], where the target is optimizing resource provisioning in a distributed application and the online learner is based on Q-learning. In [31], [32] analytical models are complemented at runtime by decision tree regressors, in the former case with the purpose of optimizing the global multiprogramming level for distributed transactional applications, in the latter to allow a continuous validation and correction of difference performance predictors in a data center.…”
Section: Reconfiguration Managermentioning
confidence: 99%
“…In [29], a model based on queuing theory is corrected at runtime by exploiting online reinforcement learning to determine the batching level delivering lowest latencies for a Total-Order based broadcast primitive. A similar approach is undertaken in [30], where the target is optimizing resource provisioning in a distributed application and the online learner is based on Q-learning. In [31], [32] analytical models are complemented at runtime by decision tree regressors, in the former case with the purpose of optimizing the global multiprogramming level for distributed transactional applications, in the latter to allow a continuous validation and correction of difference performance predictors in a data center.…”
Section: Reconfiguration Managermentioning
confidence: 99%
“…While resource allocation is more concerned with low level scheduling of tasks at the virtual machine level, the parallels between them still merit their inclusion. Tesauro investigated the use of a hybrid reinforcement learning technique for autonomic resource allocation [15]. He applied this research to optimizing server allocation in data centers.…”
Section: Background Researchmentioning
confidence: 99%
“…To efficiently calculate those trajectories, we could again leverage machine learning techniques, in particular Reinforcement Learning (RL). RL has been successfully used for Autonomic Computing in the recent past to synthesize policies for decision-making, aiming once again at self-optimization [27] [28]. A key concept in RL is "reward", a scalar that valuates the observed consequence of a decision, and which the learning agent making decisions aims to maximize.…”
Section: Ongoing and Future Workmentioning
confidence: 99%