2023
DOI: 10.32604/csse.2023.031116
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Agent Dynamic Area Coverage Based on Reinforcement Learning with Connected Agents

Abstract: Dynamic area coverage with small unmanned aerial vehicle (UAV) systems is one of the major research topics due to limited payloads and the difficulty of decentralized decision-making process. Collaborative behavior of a group of UAVs in an unknown environment is another hard problem to be solved. In this paper, we propose a method for decentralized execution of multi-UAVs for dynamic area coverage problems. The proposed decentralized decision-making dynamic area coverage (DDMDAC) method utilizes reinforcement … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
2
0
1

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 45 publications
0
2
0
1
Order By: Relevance
“…PPO is the base DRL algorithm in this paper. Aydemir and Cetin [ 159 ] proposed a distributed system for the multi-UAV coverage in partially observable environments using DRL. Only the nearby robots share their state information and observations with each other.…”
Section: Multi-robot System Applications Of Multi-agent Deep Reinforc...mentioning
confidence: 99%
“…PPO is the base DRL algorithm in this paper. Aydemir and Cetin [ 159 ] proposed a distributed system for the multi-UAV coverage in partially observable environments using DRL. Only the nearby robots share their state information and observations with each other.…”
Section: Multi-robot System Applications Of Multi-agent Deep Reinforc...mentioning
confidence: 99%
“…Therefore, a relatively small algebraic connectivity is needed when forming a formation for the follower USVs. In addition, algebraic connectivity can also reflect the convergence rate of the collaborative multi-USV system [27,28]. When the formation of follower USVs guarantees relative dispersion and safety, it is best to have the fastest possible convergence rate.…”
Section: Introductionmentioning
confidence: 99%
“…İHA'ların dağıtık olarak çalışmasına izin verilen bu yöntemde, hedef alanda İHA'lar arasındaki bağlantının kurulması da amaçlanmıştır. Aydemir ve Çetin [25], çoklu İHA kullanarak dinamik ortamda alan kapsamayı en yükseğe çıkarmaya çalışmıştır. Derin pekiştirmeli öğrenme ile modellenmiş ajanlar merkezi bir modül ile öğrenme gerçekleştirir; ancak yürütme aşamasında merkezi modül devreden çıkarılır.…”
Section: Giriş (Introduction)unclassified