The Distributed Kolkata Paise Restaurant Game

Kastampolidou, Kalliopi; Papalitsas, Christos; Andronikos, Theodore

doi:10.3390/g13030033

Cited by 5 publications

(3 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The Kolkata Paise Restaurant Problem (KPRP) was first introduced in 2007 [1] during work on the Kolkata Paise Hotel Problem. Since then, it has been studied extensively [1][2][3][4][5][6][7][8][9][10][11][12][13][14][15][16][17] in the econophysics literature. In its simplest form, we assume N ≫ 1 agents will choose among N restaurants.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Stability of Dining Clubs in the Kolkata Paise Problem with and without Cheating

Akshat¹,

Belmonte²,

Griffin³

2023

Preprint

View full text Add to dashboard Cite

We introduce the idea of a dining club to the Kolkata Paise Restaurant Problem. In this problem, N agents choose (randomly) among N restaurants, but if multiple agents choose the same restaurant, only one will eat. Agents in the dining club will coordinate their restaurant choice to avoid choice collision and increase their probability of eating. We model the problem of deciding whether to join the dining club as an evolutionary game and show that the strategy of joining the dining club is evolutionarily stable. We then introduce an optimized member tax to those individuals in the dining club, which is used to provide a safety net for those group members who don't eat because of collision with a non-dining club member. When non-dining club members are allowed to cheat and share communal food within the dining club, we show that a new unstable fixed point emerges in the dynamics. A bifurcation analysis is performed in this case. To conclude our theoretical study, we then introduce evolutionary dynamics for the cheater population and study these dynamics. Numerical experiments illustrate the behaviour of the system with more than one dining club and show several potential areas for future research.

show abstract

Section: Introductionmentioning

confidence: 99%

“…Quantum versions of the problem are considered in [12,15,16] and its relevance to other areas of physical modelling are considered in [8,10,14,17] with phase transitions considered recently in [2,9]. Distributed and coordinated solutions to optimizing agent payoff are discussed in [4][5][6]13].…”

Section: Introductionmentioning

confidence: 99%

Stability of Dining Clubs in the Kolkata Paise Problem with and without Cheating

Akshat¹,

Belmonte²,

Griffin³

2023

Preprint

View full text Add to dashboard Cite

show abstract

“…The multi-agent framework is adopted to solve the problem. Researchers in various fields have tried to extend the existing single-agent to multi-agent [24][25][26], such as Modular Q-Learning in which a single agent problem is divided into different subproblems, and each agent solves different subproblems, Ant Q-Learning of which all the agents share reward, and Nash Q-Learning which has greatly improved the efficiency of Q-Learning algorithms [27][28][29]. In this paper, the training process is completed by multi-agent parallel mode, and the optimal maintenance policy of the bridge is output by calculating the return of the whole structure.…”

Section: Introductionmentioning

confidence: 99%

An Advanced Multi-Agent Reinforcement Learning Framework of Bridge Maintenance Policy Formulation

Zhou

Yuan²,

Yang

et al. 2022

Sustainability

View full text Add to dashboard Cite

In its long service life, bridge structure will inevitably deteriorate due to coupling effects; thus, bridge maintenance has become a research hotspot. The existing algorithms are mostly based on linear programming and dynamic programming, which have low efficiency and high economic cost and cannot meet the actual needs of maintenance. In this paper, a multi-agent reinforcement learning framework was proposed to predict the deterioration process reasonably and achieve the optimal maintenance policy. Using the regression-based optimization method, the Markov transition matrix can better describe the uncertain transition process of bridge components in the maintenance year and the real-time updating of the matrix can be realized by monitoring and evaluating the performance deterioration of components. Aiming at bridges with a large number of components, the maintenance decision-making framework of multi-agent reinforcement learning can adjust the maintenance policy according to the updated Markov matrix in time, which can better adapt to the dynamic change of bridge performance in service life. Finally, the effectiveness of the framework was verified by taking the simulation data of a simply supported beam bridge and a cable-stayed bridge as examples.

show abstract

Achieving maximum utilization in optimal time for learning or convergence in the Kolkata Paise Restaurant problem

Biswas,

Sinha,

Chakrabarti

2024

Indian J Phys

View full text Add to dashboard Cite

The Distributed Kolkata Paise Restaurant Game

Cited by 5 publications

References 48 publications

Stability of Dining Clubs in the Kolkata Paise Problem with and without Cheating

Stability of Dining Clubs in the Kolkata Paise Problem with and without Cheating

An Advanced Multi-Agent Reinforcement Learning Framework of Bridge Maintenance Policy Formulation

Achieving maximum utilization in optimal time for learning or convergence in the Kolkata Paise Restaurant problem

Contact Info

Product

Resources

About