2023
DOI: 10.1016/j.ymssp.2023.110698
|View full text |Cite
|
Sign up to set email alerts
|

Vehicle dynamic dispatching using curriculum-driven reinforcement learning

Xiaotong Zhang,
Gang Xiong,
Yunfeng Ai
et al.
Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(1 citation statement)
references
References 27 publications
0
1
0
Order By: Relevance
“…Combining the performance of different algorithms across various intersections, it can be observed that the offline learning methods MFQ-ATSC and DQN exhibit faster convergence compared to the online learning methods MFAC-ATSC and A3C [19]. Transforming the multi-intersection traffic signal control problem within the region into interactions between individual intersections and neighboring intersections enhances the convergence speed of offline learning algorithms [20]. During the experience replay process, it allows for faster learning of the optimal policy from non-current strategy experiences [21] and leads to a more stable convergence of the optimal policy [22].…”
Section: Simulation Experiments Results Analysismentioning
confidence: 99%
“…Combining the performance of different algorithms across various intersections, it can be observed that the offline learning methods MFQ-ATSC and DQN exhibit faster convergence compared to the online learning methods MFAC-ATSC and A3C [19]. Transforming the multi-intersection traffic signal control problem within the region into interactions between individual intersections and neighboring intersections enhances the convergence speed of offline learning algorithms [20]. During the experience replay process, it allows for faster learning of the optimal policy from non-current strategy experiences [21] and leads to a more stable convergence of the optimal policy [22].…”
Section: Simulation Experiments Results Analysismentioning
confidence: 99%