2022
DOI: 10.1016/j.trc.2022.103560
|View full text |Cite
|
Sign up to set email alerts
|

Multi-agent reinforcement learning for Markov routing games: A new modeling paradigm for dynamic traffic assignment

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 36 publications
(9 citation statements)
references
References 38 publications
0
9
0
Order By: Relevance
“…States capture situations where the formalized system is in. In contrast to previous studies that separate observations of the decision-maker from the states [24,28,4], we define the states as fully observable information of the system for the decision-maker. When the decision-maker takes an action and gets a reward, the system transits to the next state depending on transition probabilities.…”
Section: Markov Decision Processmentioning
confidence: 99%
“…States capture situations where the formalized system is in. In contrast to previous studies that separate observations of the decision-maker from the states [24,28,4], we define the states as fully observable information of the system for the decision-maker. When the decision-maker takes an action and gets a reward, the system transits to the next state depending on transition probabilities.…”
Section: Markov Decision Processmentioning
confidence: 99%
“…Shou et al [ 55 ] considered the approach of the Markov routing game to model the interaction between adaptive agents and traffic congestion. The proposed approach allows the agent to perform repeated interactions with other agents and the traffic environment to learn optimal policies even when they have limited or incomplete information about the environment.…”
Section: Marl For Cavsmentioning
confidence: 99%
“…The number of trips decreases with increasing distance from a city center. The applied Gawron algorithm [24]- [26] allows achieving close to real-world behavior of driver routing. The output of the SUMO simulation is the travel time on each edge during the peak congestion hour.…”
Section: A Experimental Setupmentioning
confidence: 99%