2020
DOI: 10.48550/arxiv.2004.04000
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Learning from Learners: Adapting Reinforcement Learning Agents to be Competitive in a Card Game

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(14 citation statements)
references
References 0 publications
0
9
0
Order By: Relevance
“…To simulate players in the game, we must train reinforcement learning agents. Following the original design proposed by the Chef's Hat authors [22], we will implement and train two types of agents: one based on Deep Q-Learning (DQL) and another one based on Proximal Policy Optimization (PPO). We need to implement these agents as their learned behavior will be our baseline for our final evaluation.…”
Section: Methodology and Experimental Protocolmentioning
confidence: 99%
See 1 more Smart Citation
“…To simulate players in the game, we must train reinforcement learning agents. Following the original design proposed by the Chef's Hat authors [22], we will implement and train two types of agents: one based on Deep Q-Learning (DQL) and another one based on Proximal Policy Optimization (PPO). We need to implement these agents as their learned behavior will be our baseline for our final evaluation.…”
Section: Methodology and Experimental Protocolmentioning
confidence: 99%
“…DQL and PPO Playing Behavior on the Chef's Hat Card Game. In the same development wave, it was recently investigated the design and development of reinforcement learning agents to play the four players Chef's Hat competitive card game [22]. These agents were based on Deep Q-Learning (DQL) [23] and Proximal Policy Optimization (PPO) [24], and achieved success in learning how to win the game in different tasks: playing against random agents, self-play, and online adaptation towards the opponents.…”
Section: Related Workmentioning
confidence: 99%
“…As discussed by Barros et al [20], the learning agents learn different strategies when trained to play the Chef's Hat game.…”
Section: The Learning Agentsmentioning
confidence: 99%
“…We deploy our experiments onto the multiplayer Chef's Hat card game [19] as it offers a dynamic interaction between the players and simulates directly the realworld counterpart game. In the scenarios' baseline, four agents play against each other and their performance is measured directly based on how many games they win [20]. It is not possible, however, without an extensive manual observation of their action-selection pattern to explain their winning behavior while the game happens.…”
Section: Introductionmentioning
confidence: 99%
“…By exploring, a standard agent learns solely from the signals it receives from the environment. The RL approach has shown success in domains such as robotics [3,4], game-playing [5,6], and inventory management [7], among others.…”
Section: Introductionmentioning
confidence: 99%