Cooperative Learning with Difference Reward in Large-Scale Traffic Signal Control

Shuo Wang,Xingyi Ji,Xinyang Chen,Yaokai Jin,Yue Chen,Wenwei Yue

doi:10.1109/itsc55140.2022.9922382

Abstract

Adaptive traffic signal control (ATSC) can ease the increasing congestion to relieve pressure on metropolitan transportation systems. In a large-scale road network, ATSC has a high dimensional action space, which makes training very slow and algorithms difficult to converge for conventional centralized deep reinforcement learning (DRL) approaches. Multi-agent reinforcement learning (MARL) approach overcomes this issue by decomposing the joint action space to several sub-spaces and each agent searches the optimal action in its own space. However, if all the agents make their decisions independently and only maximize their own reward, the state transition probability of the environment in a Markov decision process (MDP) will come to be unstable and ATSC will not converge to the optimal policy finally. To let agents learn to cooperate, this paper proposes a novel MARL method where a difference reward overcomes the credit assignment issue among cooperated agents. Moreover, a spatially weighted reward, which can let agents consider the reward of their neighbors in the road network, is designed to evaluate the policies of decentralized actor networks such that the cooperation among agents is reinforced. By comparing against the independent DRL approach and other multi-agent approaches in a large-scale network, our proposed MARL approach is demonstrated that its effectiveness in terms of average reward and travel delay is over other approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cooperative Learning with Difference Reward in Large-Scale Traffic Signal Control

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Engineering A Large-Scale Traffic Signal Control: A Multi-Agent Reinforcement Learning Approach
Yue Chen ... Hehe Zhang
-
Yue Chen, et. al.Yue Chen ... Hehe Zhang
10 May 2021
10 May 2021

Closed Loop Optimal Adaptive Traffic Signal and Ramp Control: A Case Study on Downtown Toronto
Samah El-Tantawy ... Kasra Rezaee
-
Samah El-Tantawy, et. al.Samah El-Tantawy ... Kasra Rezaee
01 Sep 2015
01 Sep 2015

A Novel Multi-Agent Deep RL Approach for Traffic Signal Control
Wang Shijie ... Wang Shangbo
-
Wang Shijie, et. al.Wang Shijie ... Wang Shangbo
13 Mar 2023
13 Mar 2023

Multiagent Reinforcement Learning for Integrated Network of Adaptive Traffic Signal Controllers (MARLIN-ATSC): Methodology and Large-Scale Application on Downtown Toronto
Samah El-Tantawy ... Baher Abdulhai
IEEE Transactions on Intelligent Transportation Systems | VOL. 14
Samah El-Tantawy, et. al.Samah El-Tantawy ... Baher Abdulhai
01 Sep 2013
IEEE Transactions on Intelligent Transportation Systems | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cooperative Learning with Difference Reward in Large-Scale Traffic Signal Control

Abstract

Talk to us

Similar Papers