2022
DOI: 10.1016/j.trc.2022.103670
|View full text |Cite
|
Sign up to set email alerts
|

Learning the max pressure control for urban traffic networks considering the phase switching loss

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 29 publications
(5 citation statements)
references
References 20 publications
0
5
0
Order By: Relevance
“…FRAP [82] improved RL-based methods by adding a phase competition model. PressLight [83] incorporated the method of max pressure, which aggressively selects the traffic signal phase with maximum pressure to smooth congestion [84][85][86][87][88], into the reward design and shows improvement over traditional reward design. Chen et al [89] combined the idea from FRAP [82] and PressLight [83] to achieve city-level traffic signal control.…”
Section: Literature Reviewmentioning
confidence: 99%
“…FRAP [82] improved RL-based methods by adding a phase competition model. PressLight [83] incorporated the method of max pressure, which aggressively selects the traffic signal phase with maximum pressure to smooth congestion [84][85][86][87][88], into the reward design and shows improvement over traditional reward design. Chen et al [89] combined the idea from FRAP [82] and PressLight [83] to achieve city-level traffic signal control.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Traffic simulation tools, e.g., Vissim, SUMO, CityFlow, and Aimsun, were widely used to build virtual intersections. Vehicle demand level during training was unchangeable over time , or varied by time of day [13][14][15][16][17][18][19][33][34][35][36][37][38][39][40][41]. Vehicle turning ratios were typically fixed.…”
Section: Training Environmentsmentioning
confidence: 99%
“…To choose a vehicle phase to display green with a specific duration 18 [9,[11][12][13]15,21,[26][27][28][29][31][32][33][34]36,37,39,40] To choose the green time for current vehicle phase 4 [10,17,23,41] To determine whether or not to end current vehicle phase 8 [7,16,[18][19][20]24,25,38] To adjust the green time for all vehicle phases in next cycle 5 [8,14,22,30,35] Vehicle-specific performance measure used to construct rewards Number of already served vehicles 14 [12,13,17,18,[20][21][22][23]28,31,33,34,38,39] Wait time of already ...…”
Section: Action Taken By An Agentmentioning
confidence: 99%
“…The methods (Le et al., 2015; Levin et al., 2020) applied “cyclic phases” policy that arranges the phases into a fixed‐ordered sequence. In recent studies, the back‐pressure approach has been combined with model‐based optimal control methods or RL methods (D. Ma et al., 2020; Wang et al., 2022). A summary of significant publications on max‐pressure methods is presented in Table 1.…”
Section: Introductionmentioning
confidence: 99%