2022
DOI: 10.1609/aaai.v36i9.21236
|View full text |Cite
|
Sign up to set email alerts
|

MAPDP: Cooperative Multi-Agent Reinforcement Learning to Solve Pickup and Delivery Problems

Abstract: Cooperative Pickup and Delivery Problem (PDP), as a variant of the typical Vehicle Routing Problems (VRP), is an important formulation in many real-world applications, such as on-demand delivery, industrial warehousing, etc. It is of great importance to efficiently provide high-quality solutions of cooperative PDP. However, it is not trivial to provide effective solutions directly due to two major challenges: 1) the structural dependency between pickup and delivery pairs require explicit modeling and represent… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(6 citation statements)
references
References 13 publications
0
6
0
Order By: Relevance
“…With the emergence of deep learning, numerous NN-based model have been proposed in recent years to solve VRPs (Fu, Qiu, and Zha 2021;Zong et al 2022;Zhang et al 2022). These models can be generally categorized into neural improvement type and neural construction type.…”
Section: Related Workmentioning
confidence: 99%
“…With the emergence of deep learning, numerous NN-based model have been proposed in recent years to solve VRPs (Fu, Qiu, and Zha 2021;Zong et al 2022;Zhang et al 2022). These models can be generally categorized into neural improvement type and neural construction type.…”
Section: Related Workmentioning
confidence: 99%
“…Reinforcement Learning (RL) has witnessed a great number of successes in combinatorial optimization (CO) (Bengio, Lodi, and Prouvost 2021;Woo, Lee, and Cho 2022;Chen and Tian 2019;Zong et al 2022;Bai et al 2021), robotics (Tomar, Sathuluri, and Ravindran 2019), natural language processing (Li, Kiseleva, and De Rijke 2019) and computer vision (Kim et al 2021). Combination of RL and graph neural network also flourished in recent years (Vesselinova et al 2020).…”
Section: Puzzle Reassembly Strategiesmentioning
confidence: 99%
“…In ref. [169], the multi‐agent RL was explored to solve PDTSP with capacity constraint. In a recent work by Ma et al.…”
Section: Vehicle Routing Problemsmentioning
confidence: 99%
“…In ref. [169], the multi-agent RL was explored to solve PDTSP with capacity constraint. In a recent work by Ma et al [72], the neural neighbourhood search (N2S) framework was proposed which was the first learning-based solver that outstrips the well-known LKH3 solver [176] on solving PDTSP and its variant with Last-In-Fist-Out (LIFO) constraint on the synthesised dataset.…”
Section: Classical Routing Problemsmentioning
confidence: 99%