2021
DOI: 10.3390/e23040461
|View full text |Cite
|
Sign up to set email alerts
|

Co-Evolution of Predator-Prey Ecosystems by Reinforcement Learning Agents

Abstract: The problem of finding adequate population models in ecology is important for understanding essential aspects of their dynamic nature. Since analyzing and accurately predicting the intelligent adaptation of multiple species is difficult due to their complex interactions, the study of population dynamics still remains a challenging task in computational biology. In this paper, we use a modern deep reinforcement learning (RL) approach to explore a new avenue for understanding predator-prey ecosystems. Recently, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 45 publications
0
1
0
Order By: Relevance
“…As a result, the pursuitevasion issue was converted into a zero-sum game addressed through minimax-Q learning. In predatory games, Park et al [17] set up a co-evolution framework for predator and prey to allow multiple agents to learn good policies by deep reinforcement learning. Gu et al [18] presented an attention-based fault-tolerant model, which could also be applied to pursuit-evasion games, and the key idea was to utilize the multihead attention mechanism to select the correct and useful information for estimating the critics.…”
Section: Introductionmentioning
confidence: 99%
“…As a result, the pursuitevasion issue was converted into a zero-sum game addressed through minimax-Q learning. In predatory games, Park et al [17] set up a co-evolution framework for predator and prey to allow multiple agents to learn good policies by deep reinforcement learning. Gu et al [18] presented an attention-based fault-tolerant model, which could also be applied to pursuit-evasion games, and the key idea was to utilize the multihead attention mechanism to select the correct and useful information for estimating the critics.…”
Section: Introductionmentioning
confidence: 99%