Analysis of Minimax Algorithm Using Tic-Tac-Toe

Swaminathan, Bhuvaneswari; Vaishali, R.; Subashri, T.

doi:10.3233/apc200197

Intelligent Systems and Computer Technology

2020

DOI: 10.3233/apc200197

|View full text |Cite

Analysis of Minimax Algorithm Using Tic-Tac-Toe

Bhuvaneswari Swaminathan¹,

R. Vaishali²,

T. Subashri³

Abstract: The game industry has been on exponential growth, has different businesses of varying size, ethos, scope and beyond. Success of these video-games comes from a lot of labor-intensive work by developers. Every little nuance of each character, the objects within a character’s environment must be hand-coded. Repetitive work takes up a significant part of development time, which leads to an increase in glitches and logical flaws. Artificial intelligence has been used to simulate human players in software games, pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2022

Publication Types

Select...

Article1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Hybrid Training Strategies: Improving Performance of Temporal Difference Learning in Board Games

Fernández-Conde¹,

Cuenca-Jimenez²,

Plaza³

2022

Applied Sciences

View full text Add to dashboard Cite

Temporal difference (TD) learning is a well-known approach for training automated players in board games with a limited number of potential states through autonomous play. Because of its directness, TD learning has become widespread, but certain critical difficulties must be solved in order for it to be effective. It is impractical to train an artificial intelligence (AI) agent against a random player since it takes millions of games for the agent to learn to play intelligently. Training the agent against a methodical player, on the other hand, is not an option owing to a lack of exploration. This article describes and examines a variety of hybrid training procedures for a TD-based automated player that combines randomness with specified plays in a predetermined ratio. We provide simulation results for the famous tic-tac-toe and Connect-4 board games, in which one of the studied training strategies significantly surpasses the other options. On average, it takes fewer than 100,000 games of training for an agent taught using this approach to act as a flawless player in tic-tac-toe.

show abstract

Hybrid Training Strategies: Improving Performance of Temporal Difference Learning in Board Games

Fernández-Conde¹,

Cuenca-Jimenez²,

Plaza³

2022

Applied Sciences

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Analysis of Minimax Algorithm Using Tic-Tac-Toe

Cited by 1 publication

References 5 publications

Hybrid Training Strategies: Improving Performance of Temporal Difference Learning in Board Games

Hybrid Training Strategies: Improving Performance of Temporal Difference Learning in Board Games

Contact Info

Product

Resources

About