Multiagent Soft Actor–Critic for Traffic Light Timing

Lan Wu,Cong Qiao,Yuanming Wu,Yafang Tian

doi:10.1061/jtepbs.0000774

Abstract

Deep reinforcement learning has strong perception and decision-making capabilities that can effectively solve the problem of continuous high-dimensional state-action space and has become the mainstream method in the field of traffic light timing. However, due to model structural defects or different strategic mechanisms of models, most deep reinforcement learning models have problems such as convergence and divergence or poor exploration capabilities. Therefore, this paper proposes a multi-agent Soft Actor–Critic (SAC) for traffic light timing. Multi-agent SAC adds an entropy item to measure the randomness of the strategy in the objective function of traditional reinforcement learning and maximizes the sum of expected reward and entropy item to improve the model’s exploration ability. The system model can learn multiple optimal timing schemes, avoid repeated selection of the same optimal timing scheme and fall into a local optimum or fail to converge. Meanwhile, it abandons low reward value strategies to reduce data storage and sampling complexity, accelerate training, and improve the stability of the system. Comparative experiments show that the method based on multi-agent SAC traffic light timing can solve the existing problems of deep reinforcement learning and improve the efficiency of vehicles passing through in different traffic scenarios.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multiagent Soft Actor–Critic for Traffic Light Timing

Abstract

Talk to us

Similar Papers

More From: Journal of Transportation Engineering, Part A: Systems

Lead the way for us

Similar Papers

Deep reinforcement learning based collision avoidance system for autonomous ships
Yong Wang ... Zhen Yang
Ocean Engineering | VOL. 292
Yong Wang, et. al.Yong Wang ... Zhen Yang
12 Dec 2023
Ocean Engineering | VOL. 292

Deep Reinforcement Learning for Automatic Drilling Optimization Using an Integrated Reward Function
Xu Huang ... Ted Furlong
-
Xu Huang, et. al.Xu Huang ... Ted Furlong
27 Feb 2024
27 Feb 2024

Explainable AI in Deep Reinforcement Learning Models: A SHAP Method Applied in Power System Emergency Control
Ke Zhang ... Peidong Xu
-
Ke Zhang, et. al.Ke Zhang ... Peidong Xu
30 Oct 2020
30 Oct 2020

Intelligent Fault Quantitative Identification for Industrial Internet of Things (IIoT) via a Novel Deep Dual Reinforcement Learning Model Accompanied With Insufficient Samples
Yuanhong Chang ... Shuilong He
IEEE Internet of Things Journal | VOL. 9
Yuanhong Chang, et. al.Yuanhong Chang ... Shuilong He
15 Oct 2022
IEEE Internet of Things Journal | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiagent Soft Actor–Critic for Traffic Light Timing

Abstract

Talk to us

Similar Papers

More From: Journal of Transportation Engineering, Part A: Systems