Reinforcement Learning Agent under Partial Observability for Traffic Light Control in Presence of Gridlocks

Horsuwan, Thanapapas; Aswakul, Chaodit

doi:10.29007/bdgn

Cited by 3 publications

(2 citation statements)

References 6 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The length of the collected traffic jam represents the traffic condition of every action step. Based on the study [9], we define the action step as an interval of 10 seconds. The observed state as S t = {o 1,t , .…”

Section: B State Formulationmentioning

confidence: 99%

Application of Traffic Light Control in Oversaturated Urban Network Using Multi-Agent Deep Reinforcement Learning

Ei Mon,

Ochiai,

Aswakul

2024

IEEE Access

View full text Add to dashboard Cite

Adaptive traffic signal control techniques have been developed in numerous studies to increase traffic flow efficiency. Using traffic signals to design an adaptive traffic management system is ideal for reducing traffic congestion. Reinforcement learning is a branch of current approaches that try to learn a policy function through a trial-and-error process and maximize the reward through properly adjusted interaction with the learning agent's environment. We propose a traffic signal control architecture for an oversaturated urban network using Deep Q-Network. We have enhanced the learning process by incorporating diverse state information through upstream and downstream detailed traffic states. We conduct experiments on the Simulation of Urban MObility, an open-source traffic simulator that supports large-scale traffic signal control.INDEX TERMS multi-agent, oversaturated traffic, reinforcement learning, simulation of urban mobility, traffic signal control.

show abstract

Section: B State Formulationmentioning

confidence: 99%

Application of Traffic Light Control in Oversaturated Urban Network Using Multi-Agent Deep Reinforcement Learning

Ei Mon,

Ochiai,

Aswakul

2024

IEEE Access

View full text Add to dashboard Cite

show abstract

“…In ( Zhang et al, 2018 ) state observability was analyzed in a vehicle-to-infrastructure (V2I) scenario, where the traffic signal agent detects approaching vehicles with Dedicated Short Range Communications (DSRC) technology under different rates. In ( Horsuwan & Aswakul, 2019 ) a scenario with partially observable state (only occupancy sensors available) was studied, however no comparisons with different state definitions or sensors were made. In ( Chu et al, 2019 ), Chu et al introduced Multiagent A2C in scenarios where different vehicle flows distributed in the network changed their insertion rates independently.…”

Section: Related Workmentioning

confidence: 99%

Quantifying the impact of non-stationarity in reinforcement learning-based traffic signal control

Alegre

Bazzan

Silva

2021

PeerJ Computer Science

View full text Add to dashboard Cite

In reinforcement learning (RL), dealing with non-stationarity is a challenging issue. However, some domains such as traffic optimization are inherently non-stationary. Causes for and effects of this are manifold. In particular, when dealing with traffic signal controls, addressing non-stationarity is key since traffic conditions change over time and as a function of traffic control decisions taken in other parts of a network. In this paper we analyze the effects that different sources of non-stationarity have in a network of traffic signals, in which each signal is modeled as a learning agent. More precisely, we study both the effects of changing the context in which an agent learns (e.g., a change in flow rates experienced by it), as well as the effects of reducing agent observability of the true environment state. Partial observability may cause distinct states (in which distinct actions are optimal) to be seen as the same by the traffic signal agents. This, in turn, may lead to sub-optimal performance. We show that the lack of suitable sensors to provide a representative observation of the real state seems to affect the performance more drastically than the changes to the underlying traffic patterns.

show abstract

Toward A Digital Twin IoT for the Validation of AI Algorithms in Smart-City Applications

Ngadi,

Bounceur,

Bezoui

et al. 2024

Lecture Notes in Computer Science

View full text Add to dashboard Cite

The development of digital twins for road traffic has garnered significant attention within the scientific community, particularly in the realms of virtualization and the Internet of Things (IoT). The implementation of a digital twin for automobiles offers a virtual replica, capable of discerning the precise location, status, and real-time behavior of each vehicle present in the road traffic network. The primary objective of this endeavor is to create an advanced digital twin of cars that can seamlessly navigate through road traffic. To accomplish this, a meticulously selected technical approach involves employing a platform that simulates virtual sensor networks, accompanied by a purpose-built application that facilitates the access and dissemination of car-related data. Furthermore, this undertaking incorporates the utilization of an existing traffic simulator alongside a robust communication protocol to ensure seamless data transfer between the simulation environment and the sensors responsible for data collection from automobiles.

show abstract

Reinforcement Learning Agent under Partial Observability for Traffic Light Control in Presence of Gridlocks

Cited by 3 publications

References 6 publications

Application of Traffic Light Control in Oversaturated Urban Network Using Multi-Agent Deep Reinforcement Learning

Application of Traffic Light Control in Oversaturated Urban Network Using Multi-Agent Deep Reinforcement Learning

Quantifying the impact of non-stationarity in reinforcement learning-based traffic signal control

Toward A Digital Twin IoT for the Validation of AI Algorithms in Smart-City Applications

Contact Info

Product

Resources

About