Abstract

The Dynamic Traffic Assignment (DTA) is one of the important measures to alleviate urban network traffic congestion. The congestions are usually caused by stochastic traffic demands, which are generally unassignable from time dimension in the real-world but are assumed to be assignable in existing DTA methods (i.e. real-time travel demands). In this paper, a distributed DTA method for preventing urban network traffic congestion caused by stochastic real-time travel demands by improving Multi-Agent Reinforcement Learning (MARL). A team structure, which consists of decision-makers and advisers, is designed to learn parallelly in realistic DTA tasks. To reduce the size of the solution space adaptively, the dynamic critical values advised by adviser agents are adopted as constraints for the strategy space of decision-makers (i.e. main agents). A collaborative heterogeneous-adviser mechanism is designed to avoid deviation of guidance. To enhance the adaptability of DTA to the changeable external environment, the mixed strategy concept is introduced to improve the decision-making process of main agents. The respective mapping mechanisms are designed to define adaptive learning rates to improve the sensitivity of MARL. The Sioux Falls (SF) network is established as a test platform via a Dynamic Network Loading (DNL). The effectiveness of the suggested DTA method is assessed through numerical simulations SF network. Under the influence of the scenario with stochastic real-time travel demands, the results show that the proposed method outperforms in terms of the throughput of the network and the individual average travel time among the overall network. Additionally, the ability of the proposed method in response to the external environment rapidly has also been demonstrated. Adopting the suggested method can improve the state of the art to assign stochastic real-time travel demands dynamically and to avoid potential traffic congestion fundamentally.

Highlights

  • Traffic congestion of urban networks and its derivative effects are the problems faced by many cities [1]

  • The dynamic traffic demand, which is unassignable on the time scale, is the general state of the modern urban network

  • For Dynamic Traffic Assignment (DTA) with this state, a decentralized method based on Multi-Agent Reinforcement Learning (MARL) has been presented in this article

Read more

Summary

INTRODUCTION

Traffic congestion of urban networks and its derivative effects are the problems faced by many cities [1]. Each agent provides recommendations on the space of action to the corresponding main agent and updates its experience from the external environment Agents increase their adaptive capacity for DTA by employing different MARL algorithms. A. HETEROGENEOUS-ADVISER BASED MULTI-AGENT REINFORCEMENT LEARNING (HAB-MARL) MARL (described in Section III.C) theory is improved to design a distributed assignment method for DTA. Considering the application in DTA and role in HAB-MARL, the learning rate of adviser agents needs to be adaptive to dynamic goals. Definition 14: The learning rate of main agent i, αi is a function of the variance of individual travel costs ALGORITHM OF HAB-MARL According to the definitions in Section IV.A, it summarizes HAB-MARL algorithm performed by each agent in Algorithm 1

NUMERICAL SIMULATION EXPERIMENT
2) CONTRAST METHODS
Findings
CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.