2023
DOI: 10.1109/access.2023.3237420
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reinforcement Learning for the Co-Optimization of Vehicular Flow Direction Design and Signal Control Policy for a Road Network

Abstract: Reinforcement Learning (RL) is a popular approach for deciding on an optimum traffic signal control policy to alleviate congestion in a road network. However, the traffic signal control policy can also be optimized in conjunction with the design of vehicular flow directions to further improve traffic performance. The design of vehicular flow directions refers to the right of way or directional restriction imposed in a road network. Here, a new RL-based technique is presented for co-optimization of the design o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 44 publications
0
2
0
Order By: Relevance
“…The cyclic phase model, put forth by Ibrokhimov et al [17], increases the network's adaptability by using the current phase, duration, and bias value of each phase as the state space. Zhao et al [18] proposed an RL-based cooperative technique for traffic direction and signal control. Its state space consists of vehicle distance, number of vehicles, and turn signals.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The cyclic phase model, put forth by Ibrokhimov et al [17], increases the network's adaptability by using the current phase, duration, and bias value of each phase as the state space. Zhao et al [18] proposed an RL-based cooperative technique for traffic direction and signal control. Its state space consists of vehicle distance, number of vehicles, and turn signals.…”
Section: Related Workmentioning
confidence: 99%
“…where P(s) denotes the probability of s in the training sample, θ is a parameter in the main network, and θ − in the target network is updated according to Equation (18).…”
Section: Double Dueling Deep Q-networkmentioning
confidence: 99%