2022
DOI: 10.1016/j.eswa.2022.116830
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement learning in urban network traffic signal control: A systematic literature review

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 113 publications
(34 citation statements)
references
References 105 publications
0
18
0
Order By: Relevance
“…In particular, Rakelly et al (2019) developed a policy-based meta-RL model, and their action-value and policy functions were alternately updated based on off-policy data. The present study accepted the concept of this paper for traffic signal control on an areawide scale, but a value-based model was used instead of a policy-based model, since the former is more suitable for the traffic signal control on a large scale (Noaeen et al, 2022).…”
Section: Related Workmentioning
confidence: 80%
“…In particular, Rakelly et al (2019) developed a policy-based meta-RL model, and their action-value and policy functions were alternately updated based on off-policy data. The present study accepted the concept of this paper for traffic signal control on an areawide scale, but a value-based model was used instead of a policy-based model, since the former is more suitable for the traffic signal control on a large scale (Noaeen et al, 2022).…”
Section: Related Workmentioning
confidence: 80%
“…Real-time The algorithm provides near-optimal solutions with a maximum optimality gap of 5.4%. [19] Decentralized Reinforcement Learning at the Edge for traffic light control in the IoV (DRLE)…”
Section: Discussionmentioning
confidence: 99%
“…The DRL-based traffic control methods can be classified into two groups according to the traffic control mode: centralized control and decentralized control [20]. The centralized control methods usually have one RL agent which learns the optimal policy for the whole grid.…”
Section: Traffic Control Based On Rlmentioning
confidence: 99%