2022
DOI: 10.1109/tits.2021.3091014
|View full text |Cite
|
Sign up to set email alerts
|

Using Reinforcement Learning to Control Traffic Signals in a Real-World Scenario: An Approach Based on Linear Function Approximation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
4
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 19 publications
(6 citation statements)
references
References 28 publications
0
4
0
Order By: Relevance
“…Although SUMO is popular among researchers and practitioners in the industry, another software called Multi-Agent Transport Simulation (MATSim) 7 is often used in academic research. While SUMO focuses on macroscopic traffic flow modeling, MATSim uses an agent-based approach to model individual travel behavior [145]. As a result, MATSim can capture more complex individual decision processes, while SUMO is better suited for overall traffic flow modeling.…”
Section: ) Mobility and Autonomous Vehiclesmentioning
confidence: 99%
“…Although SUMO is popular among researchers and practitioners in the industry, another software called Multi-Agent Transport Simulation (MATSim) 7 is often used in academic research. While SUMO focuses on macroscopic traffic flow modeling, MATSim uses an agent-based approach to model individual travel behavior [145]. As a result, MATSim can capture more complex individual decision processes, while SUMO is better suited for overall traffic flow modeling.…”
Section: ) Mobility and Autonomous Vehiclesmentioning
confidence: 99%
“…To choose a vehicle phase to display green with a specific duration 18 [9,[11][12][13]15,21,[26][27][28][29][31][32][33][34]36,37,39,40] To choose the green time for current vehicle phase 4 [10,17,23,41] To determine whether or not to end current vehicle phase 8 [7,16,[18][19][20]24,25,38] To adjust the green time for all vehicle phases in next cycle 5 [8,14,22,30,35] Vehicle-specific performance measure used to construct rewards Number of already served vehicles 14 [12,13,17,18,[20][21][22][23]28,31,33,34,38,39] Wait time of already ...…”
Section: Action Taken By An Agentmentioning
confidence: 99%
“…Boukerche et al [13], to address the problem that existing methods ignore the impact of transmission delay on the system exchanging traffic flow information, proposed a traffic state detection method, and proved to solve the data transmission delay problem by an experimental comparison. Alegre et al [14] proposed the TOS(λ)-FB algorithm and proved its efficiency by combining the Fourier basis function and the reinforcement learning SARSA(λ) algorithm in order to solve the dimensional explosion problem due to the large state space. Wang et al [15] investigated multi-intelligent reinforcement learning for large-scale traffic signal optimization control problems.…”
Section: Introductionmentioning
confidence: 99%