Abstract

The optimal control of traffic lights and vehicle speed are two common ways to improve urban road traffic. Adaptive traffic signal control systems (ATSC) can adjust traffic light signal plans to maximize the intersection throughput globally, while connected autonomous vehicles (CAV) can proactively change their speed to stabilize traffic flow locally. Recent studies apply deep reinforcement learning (DRL) to achieve better control of either ATSC or CAV, respectively, thanks to the rise of big data. However, as it is difficult to train two agent types in an ever-changing environment, the joint optimization of ATSC and CAV still remains traditional transportation methods (e.g., mixed-integer linear programming). We propose a proximal policy optimization (PPO) based DRL model to simultaneously control traffic lights and CAV, relying on the vehicle to infrastructure (V2I) communications. Preliminary results under a 2×3 urban grid map show the effectiveness of our new DRL model in reducing fuel consumption, CO <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</inf> emissions, and travel time.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call