David Milec scite author profile

David Milec

3Publications

3Citation Statements Received

55Citation Statements Given

How they've been cited

How they cite others

Affiliations

Czech Technical University in Prague

Publications

Order By: Most citations

Complexity and Algorithms for Exploiting Quantal Opponents in Large Two-Player Games

Milec¹,

Černý²,

Lisý³

et al. 2020

Preprint

View full text Add to dashboard Cite

Solution concepts of traditional game theory assume entirely rational players; therefore, their ability to exploit subrational opponents is limited. One type of subrationality that describes human behavior well is the quantal response. While there exist algorithms for computing solutions against quantal opponents, they either do not scale or may provide strategies that are even worse than the entirely-rational Nash strategies. This paper aims to analyze and propose scalable algorithms for computing effective and robust strategies against a quantal opponent in normal-form and extensive-form games. Our contributions are: (1) we define two different solution concepts related to exploiting quantal opponents and analyze their properties; (2) we prove that computing these solutions is computationally hard; (3) therefore, we evaluate several heuristic approximations based on scalable counterfactual regret minimization (CFR); and (4) we identify a CFR variant that exploits the bounded opponents better than the previously used variants while being less exploitable by the worst-case perfectly-rational opponent.

show abstract

Complexity and Algorithms for Exploiting Quantal Opponents in Large Two-Player Games

Milec

Černý

Lisý

et al. 2021

AAAI

View full text Add to dashboard Cite

show abstract

Fast Algorithms for Poker Require Modelling it as a Sequential Bayesian Game

Kovařík¹,

Milec²,

Šustr³

et al. 2021

Preprint

View full text Add to dashboard Cite

Many recent results in imperfect information games were only formulated for, or evaluated on, poker and poker-like games such as liar's dice. We argue that sequential Bayesian games constitute a natural class of games for generalizing these results. In particular, this model allows for an elegant formulation of the counterfactual regret minimization algorithm, called publicstate CFR (PS-CFR), which naturally lends itself to an efficient implementation. Empirically, solving a poker subgame with 10 7 states by public-state CFR takes 3 minutes and 700 MB while a comparable version of vanilla CFR takes 5.5 hours and 20 GB. Additionally, the public-state formulation of CFR opens up the possibility for exploiting domain-specific assumptions, leading to a quadratic reduction in asymptotic complexity (and a further empirical speedup) over vanilla CFR in poker and other domains. Overall, this suggests that the ability to represent poker as a Bayesian extensive game played a key role in the success of CFR-based methods. Finally, we extend public-state CFR to general extensive-form games, arguing that this extension enjoys some -but not all -of the benefits of the version for sequential Bayesian games.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

David Milec

Complexity and Algorithms for Exploiting Quantal Opponents in Large Two-Player Games

Complexity and Algorithms for Exploiting Quantal Opponents in Large Two-Player Games

Fast Algorithms for Poker Require Modelling it as a Sequential Bayesian Game

Contact Info

Product

Resources

About