2002
DOI: 10.1007/3-540-36187-1_36
|View full text |Cite
|
Sign up to set email alerts
|

Learning to Reach the Pareto Optimal Nash Equilibrium as a Team

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2004
2004
2024
2024

Publication Types

Select...
4
1
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 3 publications
0
7
0
Order By: Relevance
“…Prior work in computer science has sought to answer this question by allowing communication between players. For example, in [50], convergence to the efficient outcome is attained if agents are able to agree on restricting their action sets. Also, in [51], agents are allowed to communicate with each other their understanding of the game structure.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Prior work in computer science has sought to answer this question by allowing communication between players. For example, in [50], convergence to the efficient outcome is attained if agents are able to agree on restricting their action sets. Also, in [51], agents are allowed to communicate with each other their understanding of the game structure.…”
Section: Introductionmentioning
confidence: 99%
“…This class of dynamics belongs to the general class of linear reward-inaction schemes and was considered first in mathematical psychology by [36] and introduced in engineering by [44]. It has also been applied in different forms in evolutionary economics, for modeling human or economic behavior [1,4,8,14,21], sociology, for modeling social network formation [46], and computer science, for learning how to reach the payoff-dominant equilibrium [50]. There is also discussion [4,8,21] on how this class of dynamics is related to replicator dynamics [31].…”
Section: Introductionmentioning
confidence: 99%
“…(9) Verbeeck et al suggest that individual rationality aligns with group rationality and leads to an optimal solution when a common goal is established and a primary care coordinator is chosen to minimize personal strain while maintaining high personal commitment. (11) Therefore, a single care coordinator is important to achieve a common care goal and to create some social pressure on the individual units within a network. (11) This is particularly true when actions of multi-agent networks are tied to limited resources such as time, daily routines and economical behaviour.…”
Section: Theoretical Approachmentioning
confidence: 99%
“…This FMQ heuristic takes into account how frequently an action produces its maximum corresponding reward. In Verbeeck et al (2002) a new exploration technique is used for coordination games. It is based on exploring, selfish RL (ESRL) agents, playing selfish for a period of time and then excluding actions from their private action space, so that the joint action space gets considerably smaller and the agents are able to converge to a Nash equilibrium of the remaining subgame.…”
Section: Exploration-exploitation Schemes For Independent Reinforcemementioning
confidence: 99%