2022
DOI: 10.1155/2022/3449433
|View full text |Cite
|
Sign up to set email alerts
|

Improving the Survival Time of Multiagents in Social Dilemmas through Neurotransmitter-Based Deep Q-Learning Model of Emotions

Abstract: In multiagent systems, social dilemmas often arise whenever there is a competition over the limited resources. The major challenge is to establish cooperation among intelligent virtual agents for solving the situations of social dilemmas. In humans, personality and emotions are the primary factors that lead them toward a cooperative environment. To make agents cooperate, they have to become more like humans, that is, believable. Therefore, we hypothesize that emotions according to the personality give birth to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 57 publications
(66 reference statements)
0
1
0
Order By: Relevance
“…The last decade has seen a method termed reinforcement learning, which comprises learning the optimal strategy for a situation from experience, attract a great deal of attention. Indeed, Deep Q-Learning (DQL) [9][10][11] is in active use, and is an extension of the Q-Learning method involving the learning of the expected gain (Q-value) in the future when 'a certain action' is taken in 'a certain situation' using Deep Neural Network (DNN) 10 . The idea of the Q-value is similar to that of the sum of discounted expected gains in conventional iterative games 12 .…”
Section: Introductionmentioning
confidence: 99%
“…The last decade has seen a method termed reinforcement learning, which comprises learning the optimal strategy for a situation from experience, attract a great deal of attention. Indeed, Deep Q-Learning (DQL) [9][10][11] is in active use, and is an extension of the Q-Learning method involving the learning of the expected gain (Q-value) in the future when 'a certain action' is taken in 'a certain situation' using Deep Neural Network (DNN) 10 . The idea of the Q-value is similar to that of the sum of discounted expected gains in conventional iterative games 12 .…”
Section: Introductionmentioning
confidence: 99%