Optimised Traffic Light Management Through Reinforcement Learning: Traffic State Agnostic Agent vs. Holistic Agent With Current V2I Traffic State Knowledge

Johannes V S Busch,Vincent Latzko,Frank H P Fitzek,Martin Reisslein

doi:10.1109/ojits.2020.3027518

Johannes V S Busch, Vincent Latzko + Show 2 more

Open Access

https://doi.org/10.1109/ojits.2020.3027518

Copy DOI

Abstract

Traffic light control falls into two main categories: Agnostic systems that do not exploit knowledge of the current traffic state, e.g., the positions and velocities of vehicles approaching intersections, and holistic systems that exploit knowledge of the current traffic state. Emerging fifth generation (5G) wireless networks enable Vehicle-to-Infrastructure (V2I) communication to reliably and quickly collect the current traffic state. However, to the best of our knowledge, the optimized traffic light management without and with current traffic state information has not been compared in detail. This study fills this gap in the literature by designing representative Deep Reinforcement Learning (DRL) agents that learn the control of multiple traffic lights without and with current traffic state information. Our agnostic agent considers mainly the current phase of all traffic lights and the expired times since the last change. In addition, our holistic agent considers the positions and velocities of the vehicles approaching the intersections. We compare the agnostic and holistic agents for simulated traffic scenarios, including a road network from Barcelona, Spain. We find that the holistic system substantially increases average vehicle velocities and flow rates, while reducing CO <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sub> emissions, average wait and trip times, as well as a driver stress metric.

Highlights

This section introduces the MDP that we developed in this study, including states, actions, and rewards, as well as the Deep Reinforcement Learning (DRL) algorithm to learn the control of the traffic environment
This DRL approach showed to be able to effectively learn the intelligent control of traffic light signaling at multiple intersections from interaction with its environment
We compared the performance of an agnostic agent, that cannot communicate with vehicles in the traffic network, with the performance of a holistic agent, that features a V2I communication interface and knows the positions and velocities of all vehicles

Summary

MOTIVATION

E FFECTIVE transportation systems are a key requirement for economic competitiveness and environmental sustainability. Current and upcoming standards, such as IEEE 802.11p, LTE-V, and 5G, allow the exchange of information between individual vehicles and the traffic infrastructure, eventually providing the infrastructure with holistic knowledge of the current state of the traffic system. This should, in theory, enable highly informed control decisions and facilitate congestion mitigation. We find that compared to the agnostic agent, the holistic agent achieves significantly higher average vehicle velocities and flow rates, as well as significantly shorter average trip times through the road networks. For high traffic demands at a single intersection, the holistic agent increases the average vehicle velocities only slightly compared to the agnostic agent.

BACKGROUND

REINFORCEMENT LEARNING

DRL FOR TRAFFIC CONTROL

PERFORMANCE COMPARISON

Findings

DISCUSSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Open Journal of Intelligent Transportation Systems	Publication Date: Jan 1, 2020
Citations: 64	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Optimised Traffic Light Management Through Reinforcement Learning: Traffic State Agnostic Agent vs. Holistic Agent With Current V2I Traffic State Knowledge

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Open Journal of Intelligent Transportation Systems

Lead the way for us

Similar Papers

STMARL: A Spatio-Temporal Multi-Agent Reinforcement Learning Approach for Cooperative Traffic Light Control
Yanan Wang ... Chang Tan
IEEE Transactions on Mobile Computing | VOL. 21
Yanan Wang, et. al.Yanan Wang ... Chang Tan
01 Jun 2022
IEEE Transactions on Mobile Computing | VOL. 21

RANCANG BANGUN SIMULASI PENGENDALI LAMPU LALU LINTAS PADA PERSIMPANGAN DENGAN LIMA JALUR
Rahma Farah Ningrum ...
Jurnal Teknik | VOL. 6
Rahma Farah Ningrum, et. al.Rahma Farah Ningrum ...
01 Jun 2017
Jurnal Teknik | VOL. 6

Joint Control of Lane Allocation and Traffic Light for Changeable-Lane Intersection Based on Reinforcement Learning
Emmanuel S A Gyarteng ... Yin Long
-
Emmanuel S A Gyarteng, et. al.Emmanuel S A Gyarteng ... Yin Long
22 Dec 2021
22 Dec 2021

Design of Portable Intelligent Traffic Light Alarm System for the Blind
Lili Tang
-
Lili TangLili Tang
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimised Traffic Light Management Through Reinforcement Learning: Traffic State Agnostic Agent vs. Holistic Agent With Current V2I Traffic State Knowledge

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Open Journal of Intelligent Transportation Systems