Evolving opponent models for Texas Hold 'Em

Lockett, Alan J.; Miikkulainen, Risto

doi:10.1109/cig.2008.5035618

Cited by 10 publications

(6 citation statements)

References 3 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Neuroevolution can be used in the specific role of predicting opponent strategy, as part of a player whose other parts might or might not be based on neuroevolution. Lockett and Miikkulainen evolved networks that could predict the other player's strategy in Texas Hold'em Poker, increasing the win rate of agents that used the model [66].…”

Section: Modelling Opponent Strategymentioning

confidence: 99%

Neuroevolution in Games: State of the Art and Open Challenges

Risi

Togelius

2017

IEEE Trans. Comput. Intell. AI Games

123

View full text Add to dashboard Cite

This paper surveys research on applying neuroevolution (NE) to games. In neuroevolution, artificial neural networks are trained through evolutionary algorithms, taking inspiration from the way biological brains evolved. We analyse the application of NE in games along five different axes, which are the role NE is chosen to play in a game, the different types of neural networks used, the way these networks are evolved, how the fitness is determined and what type of input the network receives.

show abstract

Section: Modelling Opponent Strategymentioning

confidence: 99%

Neuroevolution in Games: State of the Art and Open Challenges

Risi

Togelius

2017

IEEE Trans. Comput. Intell. AI Games

123

View full text Add to dashboard Cite

show abstract

“…Texas Hold ‘Em poker is a classic game used for studying learning mechanisms, especially for trying to predict opponents’ strategies and to play based on their model (Billings, Papp, Schaeffer, & Szafron, 1998; Ganzfried & Sandholm, 2011; Lockett & Miikkulainen, 2008). Lockett and Miikulainen (2008) used a coarse approximation to game-theoretic agents representations to improve the performance of agents in playing a Limit Texas Hold ‘Em poker by using opponent models initialized with a diverse mix of several parameters in order to develop agents using different strategies. The Deviation-Based Best Response (DBBR) algorithm has been developed by Ganzfriend and Sandholm (2011), for opponent modelling in large extensive-form games of imperfect information.…”

Section: An Overview Of Background and Related Workmentioning

confidence: 99%

Synthetic learning agents in game-playing social environments

Kiourt

Kalles

2016

Adaptive Behavior

View full text Add to dashboard Cite

This paper investigates the performance of synthetic agents in playing and learning scenarios in a turn-based zero-sum game and highlights the ability of opponent-based learning models to demonstrate competitive playing performances in social environments. Synthetic agents are generated based on a variety of combinations of some key parameters, such as exploitation-vs-exploration trade-off, learning backup and discount rates, and speed of learning, and interact over a very large number of games on a grid infrastructure; experimental data is then analysed to generate clusters of agents that demonstrate interesting associations between eventual performance ranking and learning parameters' setup. The evolution of these clusters indicates that agents with a predisposition to knowledge exploration and slower learning tend to perform better than exploiters, which tend to prefer fast learning. Observing these clusters vis-à-vis the playing behaviours of the agents makes it also possible to investigate how to select opponents best from a group; initial results suggest that good progress and stable evolution arise when an agent faces opponents of increasing capacity, and that an agent with a good learning mechanism setup progresses better when it faces less favourably setup agents.

show abstract

“…Perhaps the most interesting of these look at neuroevolution models, which are trained on opponents' behaviour on prior hands to predict when certain players might be playing suboptimally [4,8]. Impressively, even for a low-dimensional parameter space, some of these models have claimed on average 60% of table winnings when tested for a two-player game [9]. In addition to these quantitative studies, a substantial volume of work has been published in popular psychology [11,12], although it remains unclear whether many of the qualitative arguments are supported by quantitative evidence.…”

Section: Introductionmentioning

confidence: 99%

A Simulation Study of Texas Hold ’em Poker: What Taylor Swift Understands and James Bond Doesn’t

Falletta¹,

Woodcock²

2018

ANZIAM J.

View full text Add to dashboard Cite

Recent years have seen a large increase in the popularity of Texas hold ’em poker. It is now the most commonly played variant of the game, both in casinos and through online platforms. In this paper, we present a simulation study for games of Texas hold ’em with between two and 23 players. From these simulations, we estimate the probabilities of each player having been dealt the winning hand. These probabilities are calculated conditional on both partial information (that is, the player only having knowledge of his/her cards) and also on fuller information (that is, the true probabilities of each player winning given knowledge of the cards dealt to each player). Where possible, our estimates are compared to exact analytic results and are shown to have converged to three significant figures.With these results, we assess the poker strategies described in two recent pieces of popular culture. In comparing the ideas expressed in Taylor Swift’s song, New Romantics, and the betting patterns employed by James Bond in the 2006 film, Casino Royale, we conclude that Ms Swift demonstrates a greater understanding of the true probabilities of winning a game of Texas hold ’em poker.

show abstract

Evolving opponent models for Texas Hold 'Em

Cited by 10 publications

References 3 publications

Neuroevolution in Games: State of the Art and Open Challenges

Neuroevolution in Games: State of the Art and Open Challenges

Synthetic learning agents in game-playing social environments

A Simulation Study of Texas Hold ’em Poker: What Taylor Swift Understands and James Bond Doesn’t

Contact Info

Product

Resources

About