Logistics Distribution Route Optimization With Time Windows Based on Multi-Agent Deep Reinforcement Learning

Fahong Yu,Meijia Chen,Xiaoyun Xia,Kuibiao Deng,Qiang Peng,Dongping Zhu

doi:10.4018/ijitsa.342084

Abstract

Multi-depot vehicle routing problem with time windows (MDVRPTW) is a valuable practical issue in urban logistics. However, heuristic methods may fail to generate high-quality solutions for massive problems instantly. Thus, this article presents a novel reinforcement learning algorithm integrated with a multi-head attention mechanism and a local search strategy to solve the problem efficiently. The routing optimization was regarded as a vehicle tour generation process and an encoder-decoder was used to generate routes for vehicles departing from different depots iteratively. A multi-head attention strategy was employed for mining complex spatiotemporal correlations within time windows in the encoder. Then, a decoder with multi-agent was designed to generate solutions by optimizing reward and observing transition state. Meanwhile, a local search strategy was employed to improve the quality of solutions. The experiments results demonstrate that the proposed method can significantly outperform traditional methods in effectiveness and robustness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Logistics Distribution Route Optimization With Time Windows Based on Multi-Agent Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Technologies and Systems Approach

Lead the way for us

Journal: International Journal of Information Technologies and Systems Approach	Publication Date: Apr 9, 2024
License type: CC BY 3.0

Similar Papers

A bio-inspired approach: Firefly algorithm for Multi-Depot Vehicle Routing Problem with Time Windows
R Yesodha ... T Amudha
Computer Communications | VOL. 190
R Yesodha, et. al.R Yesodha ... T Amudha
09 Apr 2022
Computer Communications | VOL. 190

Multi-Depot Vehicle Routing Problem with Time Windows and Multi-Type Vehicle Number Limits and its Genetic Algorithm
Xuping Wang ... Chuanlei Xu
-
Xuping Wang, et. al.Xuping Wang ... Chuanlei Xu
01 Oct 2008
01 Oct 2008

Metaheuristic for solving routing problem in logistics management
M Rajmohan ... P Shahabudeen
International Journal of Operational Research | VOL. 6
M Rajmohan, et. al.M Rajmohan ... P Shahabudeen
01 Jan 2009
International Journal of Operational Research | VOL. 6

A Survey for Vehicle Routing Problems and Its Derivatives
Ming Han ... Yabin Wang
IOP Conference Series: Materials Science and Engineering | VOL. 452
Ming Han, et. al.Ming Han ... Yabin Wang
01 Dec 2018
IOP Conference Series: Materials Science and Engineering | VOL. 452

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Logistics Distribution Route Optimization With Time Windows Based on Multi-Agent Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Technologies and Systems Approach