Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence 2022
DOI: 10.24963/ijcai.2022/76
|View full text |Cite
|
Sign up to set email alerts
|

Modelling the Dynamics of Regret Minimization in Large Agent Populations: a Master Equation Approach

Abstract: Understanding the learning dynamics in multiagent systems is an important and challenging task. Past research on multi-agent learning mostly focuses on two-agent settings. In this paper, we consider the scenario in which a population of infinitely many agents apply regret minimization in repeated symmetric games. We propose a new formal model based on the master equation approach in statistical physics to describe the evolutionary dynamics in the agent population. Our model takes the form of a partial differen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
9

Relationship

2
7

Authors

Journals

citations
Cited by 71 publications
(7 citation statements)
references
References 1 publication
0
7
0
Order By: Relevance
“…(4) One Monte Carlo step is characterized by repeating procedures (2) and (3) for N times. (5) The steady state of the system is averaged over the last 2000 steps of the overall 20 000 steps. Moreover, the final results have been averaged over 10 independent runs to eliminate the effect of some uncertainties.…”
Section: Modelmentioning
confidence: 99%
See 1 more Smart Citation
“…(4) One Monte Carlo step is characterized by repeating procedures (2) and (3) for N times. (5) The steady state of the system is averaged over the last 2000 steps of the overall 20 000 steps. Moreover, the final results have been averaged over 10 independent runs to eliminate the effect of some uncertainties.…”
Section: Modelmentioning
confidence: 99%
“…The efficiency of multi-agent systems heavily relies on the cooperation among agents [1,2]. Over the past few years, the emergence of cooperative behaviors in multi-agent systems has been a prominent research topic [3][4][5]. Evolutionary game theory [6,7] provides a framework to model and simulate the evolution of behaviors in multi-agent system, where each individual in the game is treated as an agent, and the behavior evolution is achieved by interacting and learning with other agents.…”
Section: Introductionmentioning
confidence: 99%
“…In competitive environments, two-player zero-sum games are a fruitful area [10][11][12][13][14] . The optimization goal of the Counterfactual Regret Minimization (CFR) algorithm [19,20] matches the Nash Equilibrium and has worstcase guarantees. Reinforcement learning algorithms empower agents to master complex strategies from scratch self-play [21] .…”
Section: Related Workmentioning
confidence: 99%
“…Consequently, the system is ultimately dominated by defectors. To overcome this dilemma, many effective mechanisms are proposed to promote cooperation as the strategy of an individual is either cooperation or defection (hereafter called discrete strategy), for example, network structures [9,10], memory [11,12], aspiration [13,14], age structure [15,16], reputation [17,18], asymmetry [19][20][21], reward and punishment [22][23][24] and regret minimization [25].…”
Section: Introductionmentioning
confidence: 99%