2012
DOI: 10.1155/2012/208456
|View full text |Cite
|
Sign up to set email alerts
|

Reinforcement Learning Ramp Metering without Complete Information

Abstract: This paper develops a model of reinforcement learning ramp metering (RLRM) without complete information, which is applied to alleviate traffic congestions on ramps. RLRM consists of prediction tools depending on traffic flow simulation and optimal choice model based on reinforcement learning theories. Moreover, it is also a dynamic process with abilities of automaticity, memory and performance feedback. Numerical cases are given in this study to demonstrate RLRM such as calculating outflow rate, density, avera… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 15 publications
0
3
0
Order By: Relevance
“…Several intelligent methods have been proposed for ramp metering including reinforcement learning (RL) [41] and artificial neural networks (ANN) [42]. Research presented in [18,40] and [43][44][45][46][47][48][49][50][51][52][53][54] describe some of the models that use reinforcement learning. Most of the reinforcement learning models use Q-learning [49][50][51] or some methods based on Q-learning.…”
Section: Ramp Metering Algorithmsmentioning
confidence: 99%
“…Several intelligent methods have been proposed for ramp metering including reinforcement learning (RL) [41] and artificial neural networks (ANN) [42]. Research presented in [18,40] and [43][44][45][46][47][48][49][50][51][52][53][54] describe some of the models that use reinforcement learning. Most of the reinforcement learning models use Q-learning [49][50][51] or some methods based on Q-learning.…”
Section: Ramp Metering Algorithmsmentioning
confidence: 99%
“…Peng et al [16] presented a new macro model of traffic flow considering the anticipation optimal velocity. Besides, some microscopic models of driver-vehicle behavior have also been proposed, which include the early linear models proposed by Chandler et al [17], the early nonlinear models presented by Pipes [18] and many recent remarkable works of Wang et al [19], Zhou et al [20], Tang et al [21], Hu et al [22], and so on. They treat each individual vehicle as a particle and regard traffic as a complex system of interacting particles.…”
Section: Introductionmentioning
confidence: 99%
“…The basic RL algorithm named -learning was adopted by this system to alleviate traffic congestion caused by incidents. After this work, several -learning systems considering both local (e.g., [23,24]) and coordinated (e.g., [25,26]) control problems were proposed. However, -learning can only learn from real interactions with the traffic operation and cannot make full use of historical data (or models).…”
Section: Introductionmentioning
confidence: 99%