1997
DOI: 10.1007/s004220050351
|View full text |Cite
|
Sign up to set email alerts
|

Specialization in multi-agent systems through learning

Abstract: Specialization is a common feature in animal societies that leads to an improvement in the fitness of the team members and to an increase in the resources obtained by the team. In this paper we propose a simple reinforcement learning approach to specialization in an artificial multi-agent system. The system is composed of homogeneous and non-communicating agents. Because there is no communication, the number of agents in the team can easily scale up. Agents have the same initial functionalities, but they learn… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
19
0

Year Published

2002
2002
2014
2014

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 31 publications
(19 citation statements)
references
References 10 publications
0
19
0
Order By: Relevance
“…Optimization algorithms allowing controller differentiation or using local reinforcement signals face a daunting credit assignment problem, because it is difficult to determine which actions of which agent are responsible for the outcomes observed in the system. Learning under these conditions has been addressed with the help of explicit communication, 34 a priori information about proper task completion, 35 and careful alignment of individual and group performance metrics, 36,37 although none of these approaches can be easily applied to the problem of plume traversal. Extensive peer-to-peer communication is undesirable, because the overhead of providing and maintaining each agent's unique identification, as well as a possibly exponentially increasing number of messages, makes it difficult to scale to large group sizes.…”
Section: Off-line Machine Learning Optimizationmentioning
confidence: 99%
“…Optimization algorithms allowing controller differentiation or using local reinforcement signals face a daunting credit assignment problem, because it is difficult to determine which actions of which agent are responsible for the outcomes observed in the system. Learning under these conditions has been addressed with the help of explicit communication, 34 a priori information about proper task completion, 35 and careful alignment of individual and group performance metrics, 36,37 although none of these approaches can be easily applied to the problem of plume traversal. Extensive peer-to-peer communication is undesirable, because the overhead of providing and maintaining each agent's unique identification, as well as a possibly exponentially increasing number of messages, makes it difficult to scale to large group sizes.…”
Section: Off-line Machine Learning Optimizationmentioning
confidence: 99%
“…Such approaches have become popular in collective behavior task domains where one does not know, a priori, the degree of specialization required to optimally solve the given task (Stanley, Bryant, & Miikkulainen, 2005b), (Waibel, Floreano, Magnenat, & Keller, 2006), (Gautrais, Theraulaz, Deneubourg, & Anderson, 2002), (Potter et al, 2001), (Luke & Spector, 1996), (Theraulaz, Bonabeau, & Deneubourg, 1998b), , (Murciano, Millan, & Zamora, 1997).…”
Section: Emergent Specializationmentioning
confidence: 99%
“…Multirobot learning using several methods in a wide variety of scenarios has been explored [2], [23]. Specialization in multiagent systems using reinforcement learning was studied in [18]. Techniques for increasing individual learning speed via multi-robot learning were studied in [8] and [14].…”
Section: Introductionmentioning
confidence: 99%