2022
DOI: 10.48550/arxiv.2207.07751
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

MARLAS: Multi Agent Reinforcement Learning for cooperated Adaptive Sampling

Abstract: The multi-robot adaptive sampling problem aims at finding trajectories for a team of robots to efficiently sample the phenomenon of interest within a given endurance budget of the robots. In this paper, we propose a robust and scalable approach using decentralized Multi-Agent Reinforcement Learning for cooperated Adaptive Sampling (MARLAS) of quasi-static environmental processes. Given a prior on the field being sampled, the proposed method learns decentralized policies for a team of robots to sample high-util… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 21 publications
0
2
0
Order By: Relevance
“…Wei and Zheng [28] have proposed an independent learning technique with credit assignment to solve this notoriously difficult problem. Pan, Manjanna, and Hsieh [29] have recently proposed a policy gradient-based DRL for multi-robot information sampling. Unlike the prior studies, they do not use GP as the underlying information inference tool.…”
Section: Problem Setupmentioning
confidence: 99%
“…Wei and Zheng [28] have proposed an independent learning technique with credit assignment to solve this notoriously difficult problem. Pan, Manjanna, and Hsieh [29] have recently proposed a policy gradient-based DRL for multi-robot information sampling. Unlike the prior studies, they do not use GP as the underlying information inference tool.…”
Section: Problem Setupmentioning
confidence: 99%
“…This class of approaches loads the communication channel to an even greater extent, but this disadvantage is compensated by the lower dependence of the problem solving process on the imposed maximum message transmission range constraints. Thus, in [23], a decentralized adaptive sampling approach based on reinforcement learning during the problem solving process is proposed. The strategy has been compared with the centralized DARP area division algorithm [24], taking into account the different numbers of mobile robots used and the communication range constraints.…”
Section: Introductionmentioning
confidence: 99%