The Go Transformer: Natural Language Modeling for Game Play

Ciolino, Matthew; Kalin, Josh; Noever, David

doi:10.1109/ai4i49448.2020.00012

Cited by 7 publications

(3 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Depending on the game aim, this involves (1) translating the current situation with regards to the expected outcome into language; (2) generating content using a large language model; and (3) translating the output back in order to implement it in a game system. For example, Ciolino et al [43] used a fine-tuned GPT-2 model to generate Go moves by translating the current board state to text and the language output back to action suggestions. Such a process is naturally easier for purely text-based tasks, such as dialog generation, where text is already the expected output, and the expected output can comparatively easily be described in language [39,41].…”

Section: Language Model Approachmentioning

confidence: 99%

An appraisal-based chain-of-emotion architecture for affective language model game agents

Croissant,

Frister,

Schofield

et al. 2024

PLoS ONE

View full text Add to dashboard Cite

The development of believable, natural, and interactive digital artificial agents is a field of growing interest. Theoretical uncertainties and technical barriers present considerable challenges to the field, particularly with regards to developing agents that effectively simulate human emotions. Large language models (LLMs) might address these issues by tapping common patterns in situational appraisal. In three empirical experiments, this study tests the capabilities of LLMs to solve emotional intelligence tasks and to simulate emotions. It presents and evaluates a new Chain-of-Emotion architecture for emotion simulation within video games, based on psychological appraisal research. Results show that it outperforms control LLM architectures on a range of user experience and content analysis metrics. This study therefore provides early evidence of how to construct and test affective agents based on cognitive processes represented in language models.

show abstract

Section: Language Model Approachmentioning

confidence: 99%

An appraisal-based chain-of-emotion architecture for affective language model game agents

Croissant,

Frister,

Schofield

et al. 2024

PLoS ONE

View full text Add to dashboard Cite

show abstract

“…Another potential search method is global search. Ciolino et al used a general language model similar to GPT2 to model the chess game strategy, which can obtain a favorable opening action sequence [58]. This modeling method provides a new global search method for many board games, providing historical text annotations as training data.…”

Section: Search Strategymentioning

confidence: 99%

Overview of Amazon chess’ game playing algorithms

SUN,

Gao

2023

3rd International Conference on Applied Mathematics, Modelling, and Intelligent Computing (CAMMIC 2023)

View full text Add to dashboard Cite

Amazon chess is a machine game with high dimension branches, in which the first step is more than 2000 actions, so it is an ideal test platform for search algorithm and Monte Carlo tree search. This paper analyses the rules and common game algorithms of Amazon chess based on its environment, and compares the advantages and disadvantages of different algorithms. Finally, the improvement direction of Monte Carlo tree search algorithm is reviewed from four aspects of evaluation function, database, search strategy and parallel computing, and the research prospect of Amazon Monte Carlo game algorithm is prospected

show abstract

“…One such example is Kaplan, Sauer, and Sosa (2017), where natural language methods were used with deep reinforcement learning to play Atari games. In addition to this, recent work on using transformer models with reinforcement learning includes: Parisotto et al (2020), Noever, Ciolino, and Kalin (2020) train GPT-2 (Radford et al 2019) on the PGN format to learn chess, Ciolino, Kalin, and Noever (2020) trained GPT-2 in a similar way to learn Go, and Stein, Filchenkov, and Asadulaev (2020) used Transformers for Deep Q-learning to play Atari games. Krause et al (2020) introduced a coding scheme to improve small language models.…”

Section: Related Workmentioning

confidence: 99%

Reinforcement Learning with Information-Theoretic Actuation

Catt¹,

Hutter²,

Veness³

2021

Preprint

View full text Add to dashboard Cite

Reinforcement Learning formalises an embodied agent's interaction with the environment through observations, rewards and actions. But where do the actions come from? Actions are often considered to represent something external, such as the movement of a limb, a chess piece, or more generally, the output of an actuator. In this work we explore and formalize a contrasting view, namely that actions are best thought of as the output of a sequence of internal choices with respect to an action model. This view is particularly well-suited for leveraging the recent advances in large sequence models as prior knowledge for multi-task reinforcement learning problems. Our main contribution in this work is to show how to augment the standard MDP formalism with a sequential notion of internal action using information-theoretic techniques, and that this leads to self-consistent definitions of both internal and external action value functions.

show abstract

The Go Transformer: Natural Language Modeling for Game Play

Cited by 7 publications

References 12 publications

An appraisal-based chain-of-emotion architecture for affective language model game agents

An appraisal-based chain-of-emotion architecture for affective language model game agents

Overview of Amazon chess’ game playing algorithms

Reinforcement Learning with Information-Theoretic Actuation

Contact Info

Product

Resources

About