2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2021
DOI: 10.1109/iros51168.2021.9635898
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Agent Reinforcement Learning for Visibility-based Persistent Monitoring

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(2 citation statements)
references
References 15 publications
0
2
0
Order By: Relevance
“…Different from monitoring an underlying vector field of interest [6], [7], which is typically approached as generalized coverage problem or as a variant of the orienteering problem [8], [9], our agent aims to maximize information gain in the vicinity of the targets (i.e., minimizes the uncertainty over the true target positions) [10], [11]. Since this uncertainty grows over time for targets not in direct view of the agent, the agent has to leave some of the targets temporarily untracked to relocate others and then return to those as soon as possible to avoid losing them.…”
Section: Introductionmentioning
confidence: 99%
“…Different from monitoring an underlying vector field of interest [6], [7], which is typically approached as generalized coverage problem or as a variant of the orienteering problem [8], [9], our agent aims to maximize information gain in the vicinity of the targets (i.e., minimizes the uncertainty over the true target positions) [10], [11]. Since this uncertainty grows over time for targets not in direct view of the agent, the agent has to leave some of the targets temporarily untracked to relocate others and then return to those as soon as possible to avoid losing them.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, multi-agent reinforcement learning (MARL) methods have proven to be effective for policy searching in complex and high-dimensional multi-robot tasks and they have been explored for information gathering and coverage problems. Persistent monitoring and coverage with MARL involve defining equal utility for every state [12,13]. In the sampling problem, utilities are distributed unevenly, thus requiring nonuniform exploration.…”
Section: Introductionmentioning
confidence: 99%