2022
DOI: 10.48550/arxiv.2212.11498
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…Specifically, redundant computation, which implies that the same or similar computation is performed redundantly by different agents, is a well-known traditional problem in MASs [19] and comes from overlapping observations, leading to similar associated processes in several agents. This problem widely exists in applications such as distributed robot systems [20], distributed sensor networks [19,21] and distributed air traffic control [22], where there is a natural spatial distribution of information and where the agents have many overlapping observations. As shown in Figure 1a, the observations of the decentralized executing agents contain numerous overlapping regions.…”
Section: Introductionmentioning
confidence: 99%
“…Specifically, redundant computation, which implies that the same or similar computation is performed redundantly by different agents, is a well-known traditional problem in MASs [19] and comes from overlapping observations, leading to similar associated processes in several agents. This problem widely exists in applications such as distributed robot systems [20], distributed sensor networks [19,21] and distributed air traffic control [22], where there is a natural spatial distribution of information and where the agents have many overlapping observations. As shown in Figure 1a, the observations of the decentralized executing agents contain numerous overlapping regions.…”
Section: Introductionmentioning
confidence: 99%
“…Cooperative multi-agent reinforcement learning (MARL) jointly trains a team of agents to exhibit behaviour which maximises shared cumulative rewards. MARL can tackle problems such as autonomous driving (Shalev-Shwartz et al, 2016;Zhou et al, 2021) and warehouse logistics (Li et al, 2019;Krnjaic et al, 2022), but its real-world adaptation is still limited. Two remaining challenges of MARL are the large number of samples required to learn cooperation and the non-stationarity of the optimisation due to agents learning simultaneously (Papoudakis et al, 2019).…”
Section: Introductionmentioning
confidence: 99%