Abstract

Traffic signal control (TSC) plays an important role in alleviating heavy traffic congestion problem. It is helpful to provide an effective transportation system by optimizing traffic signals intelligently. Recently, many deep reinforcement learning methods are proposed to solve TSC. However, most of these methods are trained and tested in a fixed roadnet with the same traffic flow environment. They can not adapt to new environments. Some meta-reinforcement learning methods are proposed to solve this problem, but they can not properly decide when to coordinate traffic signals. This paper proposes a multi-agent meta-reinforcement learning method with coordination and reward shaping to solve TSC. The proposed method combines independent learning and neighbor-aware learning to adapt to different TSC environments. Besides, the proposed method constructs a novel reward shaping. The reward shaping can enhance traffic efficiency by encouraging adjacent intersections to generate more green waves. Based on green waves, vehicles can go straight through multiple intersections without stopping. Experimental results demonstrate that the proposed method achieves the state-of-the-art generalization performance on synthetic and real-world datasets.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.