Evolving small-board Go players using coevolutionary temporal difference learning with archives

Krawiec, Krzysztof; Jaśkowski, Wojciech; Szubert, Marcin

doi:10.2478/v10006-011-0057-3

Cited by 12 publications

(10 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In particular, we investigate how the size of the search space and the type of evaluation function influence the performance of evolutionary, coevolutionary, temporal difference learning (TDL) algorithms, and their hybrids introduced in our previous works [36], [15]. This allows us also to discuss the benefits of global and local search hybridization.…”

Section: Introductionmentioning

confidence: 99%

On Scalability, Generalization, and Hybridization of Coevolutionary Learning: A Case Study for Othello

Szubert

Jaśkowski

Krawiec

2013

IEEE Trans. Comput. Intell. AI Games

Self Cite

View full text Add to dashboard Cite

This study investigates different methods of learning to play the game of Othello. The main questions posed concern scalability of algorithms with respect to the search space size and their capability to generalize and produce players that fare well against various opponents. The considered algorithms represent strategies as -tuple networks, and employ self-play temporal difference learning (TDL), evolutionary learning (EL) and coevolutionary learning (CEL), and hybrids thereof. To assess the performance, three different measures are used: score against an a priori given opponent (a fixed heuristic strategy), against opponents trained by other methods (round-robin tournament), and against the top-ranked players from the online Othello League. We demonstrate that although evolutionary-based methods yield players that fare best against a fixed heuristic player, it is the coevolutionary temporal difference learning (CTDL), a hybrid of coevolution and TDL, that generalizes better and proves superior when confronted with a pool of previously unseen opponents. Moreover, CTDL scales well with the size of representation, attaining better results for larger -tuple networks. By showing that a strategy learned in this way wins against the top entries from the Othello League, we conclude that it is one of the best 1-ply Othello players obtained to date without explicit use of human knowledge.

show abstract

Section: Introductionmentioning

confidence: 99%

On Scalability, Generalization, and Hybridization of Coevolutionary Learning: A Case Study for Othello

Szubert

Jaśkowski

Krawiec

2013

IEEE Trans. Comput. Intell. AI Games

Self Cite

View full text Add to dashboard Cite

show abstract

“…Following other studies [16,6], we identify the computational effort with the number of games played in interactions among individuals.…”

Section: The Experimentsmentioning

confidence: 99%

Improving coevolution by random sampling

Jaśkowski

Liskowski

Szubert

et al. 2013

Proceedings of the 15th Annual Conference on Genetic and Evolutionary Computation

Self Cite

View full text Add to dashboard Cite

Recent developments cast doubts on the effectiveness of coevolutionary learning in interactive domains. A simple evolution with fitness evaluation based on games with random strategies has been found to generalize better than competitive coevolution. In an attempt to investigate this phenomenon, we analyze the utility of random opponents for one-and two-population competitive coevolution applied to learning strategies for the game of Othello. We show that if coevolution uses two-population setup and engages also random opponents, it is capable of producing equally good strategies as evolution with random sampling for the expected utility performance measure.To investigate the differences between analyzed methods, we introduce performance profile, a tool that measures the player's performance against opponents of various strength. The profiles reveal that evolution with random sampling produces players coping well with mediocre opponents, but playing relatively poorly against stronger ones. This finding explains why in the round-robin tournament, evolution with random sampling is one of the worst methods from all those considered in this study.

show abstract

“…Even though the archive mechanisms such as the Hall of Fame (HoF) [9,6,5] have been studied in the past and are known to maintain progress in an evolutionary arms race, they do not provide any specific guarantees in terms of convergence to the optimal solution. Moreover, the characteristics of archive's influence on evolving individuals is little known, which constitutes the main motivation for this study.…”

Section: Introductionmentioning

confidence: 99%

Quantitative analysis of the hall of fame coevolutionary archives

Liskowski

2013

Proceedings of the 15th Annual Conference Companion on Genetic and Evolutionary Computation

View full text Add to dashboard Cite

This paper provides an attempt to investigate the properties of the Hall of Fame archive in two-population competitive coevolution environment applied to the game of Othello. Using the measure of expected utility, a round-robin tournament and performance profiles, we show that coevolution can be biased towards playing better with stronger opponents if it is driven by interactions with the past champions kept in the archive, in addition to pure competition among coevolving individuals. Moreover, the Hall of Fame does not necessarily influence the overall perfromance in terms of expected utility, as it trades-off the ability to cope with opponents of various strength, so that the resulting players are more likely to win with a strong opponent than with a weak one.

show abstract

Evolving small-board Go players using coevolutionary temporal difference learning with archives

Cited by 12 publications

References 32 publications

On Scalability, Generalization, and Hybridization of Coevolutionary Learning: A Case Study for Othello

On Scalability, Generalization, and Hybridization of Coevolutionary Learning: A Case Study for Othello

Improving coevolution by random sampling

Quantitative analysis of the hall of fame coevolutionary archives

Contact Info

Product

Resources

About