1995
DOI: 10.1613/jair.121
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive Load Balancing: A Study in Multi-Agent Learning

Abstract: We study the process of multi-agent reinforcement learning in the context of load balancing in a distributed system, without use of either central coordination or explicit communication. We rst de ne a precise framework in which to study adaptive load balancing, important features of which are its stochastic nature and the purely local information available to individual agents. Given this framework, we show illuminating results on the interplay b e t ween basic adaptive behavior parameters and their e ect on … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
85
1
7

Year Published

2001
2001
2015
2015

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 133 publications
(97 citation statements)
references
References 31 publications
4
85
1
7
Order By: Relevance
“…Solipsistic Agents: MAS's with solipsistic agents have been successfully applied to 9 a multitude of problems 60,112,122,227,233]. Generally, these schemes use RL algorithms similar to those discussed in Section 3.1.1.…”
Section: Reinforcement Learning-based Multi-agent Systemsmentioning
confidence: 99%
“…Solipsistic Agents: MAS's with solipsistic agents have been successfully applied to 9 a multitude of problems 60,112,122,227,233]. Generally, these schemes use RL algorithms similar to those discussed in Section 3.1.1.…”
Section: Reinforcement Learning-based Multi-agent Systemsmentioning
confidence: 99%
“…(3) The exploration rate, erate, which determines the probability with which the Agent will ignore its trust ratings and send a bid to a random SCP Agent, where 0 represents no exploration of the market and 1.0 causes the SSP Agent to always select suppliers at random. The exploration rate parameter addresses a practical problem familiar in the reinforcement learning paradigm, that of balancing the advantages to be gained from trading with known and already trusted partners with the opportunity to discover better partners from the larger pool ( [21]). …”
Section: Ssp Allocator Functionmentioning
confidence: 99%
“…The vast majority of the work in this eld has focused on making agents more knowledgeable and able. This has been achieved in several ways: by giving the deliberative a g e n t a deeper knowledge base and ability to reason about data 21], giving it the ability to plan actions 26], negotiate with other agents 20], or change its strategies in response to actions of other agents 23,8]. At the opposite end of the spectrum lie agent-based systems that demonstrate complex group behavior, but whose individual elements are very simple.…”
Section: Individual Vs Emergent Complexitymentioning
confidence: 99%