2012
DOI: 10.1177/1059712312449547
|View full text |Cite
|
Sign up to set email alerts
|

Rewards for pairs of Q-learning agents conducive to turn-taking in medium-access games

Abstract: We describe a class of stateful games, which we call 'medium-access games', as a model for human and machine communication and demonstrate how to use the Nash equilibria of those games as played by pairs of agents with stationary policies to predict turn-taking behaviour in Q-learning agents based on the agents' reward function. We identify which fixed policies exhibit turn-taking behaviour in medium-access games and show how to compute the Nash equilibria of such games by using Markov chain methods to calcula… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 40 publications
0
1
0
Order By: Relevance
“…erratic, passive, and aggressive) such that a learning-agent can learn to collaboratively build towers. Our behavior model is able to accurately predict the type of turn-taking action other agents will take, and a Q-learning agent can use our behavioral model to learn a turn-taking policy [16] that can be used in interactions involving varying numbers of agents.…”
Section: Introductionmentioning
confidence: 99%
“…erratic, passive, and aggressive) such that a learning-agent can learn to collaboratively build towers. Our behavior model is able to accurately predict the type of turn-taking action other agents will take, and a Q-learning agent can use our behavioral model to learn a turn-taking policy [16] that can be used in interactions involving varying numbers of agents.…”
Section: Introductionmentioning
confidence: 99%