An Improved Fitness Evaluation Mechanism with Memory in Spatial Prisoner's Dilemma Game on Regular Lattices

Wang, Juan; Liu, Lina; Dong, Enzeng; Li, Wang

doi:10.1088/0253-6102/59/3/02

Cited by 10 publications

(2 citation statements)

References 49 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For any game matrix, IP 0 will still identify other players’ strategies and maximize the difference in stationary payout. Information players should fare well in a variety of other contexts, including asymmetric games and population games on graphs, time-averaged fitness [ 4 ], and increased interaction neighborhood size on regular lattices [ 5 ].…”

Section: Discussionmentioning

confidence: 99%

“…The Prisoner’s Dilemma (PD) [ 1 ] is a two player game with a long history of study in evolutionary game theory [ 2 ] and finite populations [ 3 ]. Work on time-averaged fitness [ 4 ] and interaction neighborhood size on regular lattices [ 5 ], is of particular interest. Payoffs for the Prisoner’s Dilemma are usually defined via a game matrix

with T > R > P > S and often 2 R > T + S .…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

The Art of War: Beyond Memory-one Strategies in Population Games

2015

View full text Add to dashboard Cite

We show that the history of play in a population game contains exploitable information that can be successfully used by sophisticated strategies to defeat memory-one opponents, including zero determinant strategies. The history allows a player to label opponents by their strategies, enabling a player to determine the population distribution and to act differentially based on the opponent’s strategy in each pairwise interaction. For the Prisoner’s Dilemma, these advantages lead to the natural formation of cooperative coalitions among similarly behaving players and eventually to unilateral defection against opposing player types. We show analytically and empirically that optimal play in population games depends strongly on the population distribution. For example, the optimal strategy for a minority player type against a resident TFT population is ALLC, while for a majority player type the optimal strategy versus TFT players is ALLD. Such behaviors are not accessible to memory-one strategies. Drawing inspiration from Sun Tzu’s the Art of War, we implemented a non-memory-one strategy for population games based on techniques from machine learning and statistical inference that can exploit the history of play in this manner. Via simulation we find that this strategy is essentially uninvadable and can successfully invade (significantly more likely than a neutral mutant) essentially all known memory-one strategies for the Prisoner’s Dilemma, including ALLC (always cooperate), ALLD (always defect), tit-for-tat (TFT), win-stay-lose-shift (WSLS), and zero determinant (ZD) strategies, including extortionate and generous strategies.

show abstract

Section: Discussionmentioning

confidence: 99%

with T > R > P > S and often 2 R > T + S .…”

Section: Introductionmentioning

confidence: 99%