Deep Reinforcement Learning Approach to Solve Dynamic Vehicle Routing Problem with Stochastic Customers

Waldy Joe,Hoong Chuin Lau

doi:10.1609/icaps.v30i1.6685

Abstract

In real-world urban logistics operations, changes to the routes and tasks occur in response to dynamic events. To ensure customers' demands are met, planners need to make these changes quickly (sometimes instantaneously). This paper proposes the formulation of a dynamic vehicle routing problem with time windows and both known and stochastic customers as a route-based Markov Decision Process. We propose a solution approach that combines Deep Reinforcement Learning (specifically neural networks-based Temporal-Difference learning with experience replay) to approximate the value function and a routing heuristic based on Simulated Annealing, called DRLSA. Our approach enables optimized re-routing decision to be generated almost instantaneously. Furthermore, to exploit the structure of this problem, we propose a state representation based on the total cost of the remaining routes of the vehicles. We show that the cost of the remaining routes of vehicles can serve as proxy to the sequence of the routes and time window requirements. DRLSA is evaluated against the commonly used Approximate Value Iteration (AVI) and Multiple Scenario Approach (MSA). Our experiment results show that DRLSA can achieve on average, 10% improvement over myopic, outperforming AVI and MSA even with small training episodes on problems with degree of dynamism above 0.5.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the International Conference on Automated Planning and Scheduling	Publication Date: Jun 1, 2020
Citations: 36	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Deep Reinforcement Learning Approach to Solve Dynamic Vehicle Routing Problem with Stochastic Customers

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling

Lead the way for us

Similar Papers

Scenario-Based Planning for Partially Dynamic Vehicle Routing with Stochastic Customers
Russell W Bent ... Pascal Van Hentenryck
Operations Research | VOL. 52
Russell W Bent, et. al.Russell W Bent ... Pascal Van Hentenryck
01 Dec 2004
Operations Research | VOL. 52

A branch‐and‐regret heuristic for stochastic and dynamic vehicle routing problems
Lars Magnus Hvattum ... Gilbert Laporte
Networks | VOL. 49
Lars Magnus Hvattum, et. al.Lars Magnus Hvattum ... Gilbert Laporte
27 Mar 2007
Networks | VOL. 49

Improved ant colony optimisation for the dynamic multi-depot vehicle routing problem
Bin Yu ... Baozhen Yao
International Journal of Logistics Research and Applications | VOL. 16
Bin Yu, et. al.Bin Yu ... Baozhen Yao
01 Apr 2013
International Journal of Logistics Research and Applications | VOL. 16

Optimizing a Dynamic Vehicle Routing Problem with Deep Reinforcement Learning: Analyzing State-Space Components
Anna Konovalenko ... Lars Magnus Hvattum
Logistics | VOL. 8
Anna Konovalenko, et. al.Anna Konovalenko ... Lars Magnus Hvattum
02 Oct 2024
Logistics | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Reinforcement Learning Approach to Solve Dynamic Vehicle Routing Problem with Stochastic Customers

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling