A reinforcement learning framework for improving parking decisions in last-mile delivery

Juan E Muriel,Lele Zhang,Jan C Fransoo,Juan G Villegas

doi:10.1080/21680566.2024.2337216

Juan E Muriel, Lele Zhang + Show 2 more

Open Access

https://doi.org/10.1080/21680566.2024.2337216

Copy DOI

Abstract

This study leverages simulation-optimisation with a Reinforcement Learning (RL) model to analyse the routing behaviour of delivery vehicles (DVs). We conceptualise the system as a stochastic k-armed bandit problem, representing a sequential interaction between a learner (the DV) and its surrounding environment. Each DV is assigned a random number of customers and an initial delivery route. If a loading zone is unavailable, the RL model is used to select a delivery strategy, thereby modifying its route accordingly. The penalty is gauged by the additional trucking and walking time incurred compared to the originally planned route. Our methodology is tested on a simulated network featuring realistic traffic conditions and a fleet of DVs employing four distinct lastmile delivery strategies. The results of our numerical experiments underscore the advantages of providing DVs with an RL-based decision support system for en-route decision-making, yielding benefits to the overall efficiency of the transport network. Highlights Combining simulation and optimisation algorithms with reinforcement learning Model DVs en-route parking decisions with a k-armed bandit algorithm Evaluating the impacts of delivery strategies on traffic congestion and in last-mile delivery efficiency

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A reinforcement learning framework for improving parking decisions in last-mile delivery

Abstract

Talk to us

Similar Papers

More From: Transportmetrica B: Transport Dynamics

Lead the way for us

Journal: Transportmetrica B: Transport Dynamics	Publication Date: Apr 8, 2024
License type: other-oa

Similar Papers

Author response: Associability-modulated loss learning is increased in posttraumatic stress disorder
Vanessa M Brown ... John M Wang
-
Vanessa M Brown, et. al.Vanessa M Brown ... John M Wang
19 Oct 2017
19 Oct 2017

Deep Reinforcement Learning for Automatic Drilling Optimization Using an Integrated Reward Function
Xu Huang ... Ted Furlong
-
Xu Huang, et. al.Xu Huang ... Ted Furlong
27 Feb 2024
27 Feb 2024

Author response: DYT1 dystonia increases risk taking in humans
David Arkadir ... Susan B Bressman
-
David Arkadir, et. al.David Arkadir ... Susan B Bressman
26 Apr 2016
26 Apr 2016

SeaRank: relevance prediction based on click models in a reinforcement learning framework
Amir Hosein Keyhanipour ... Farhad Oroumchian
Data Technologies and Applications | VOL. 57
Amir Hosein Keyhanipour, et. al.Amir Hosein Keyhanipour ... Farhad Oroumchian
08 Sep 2022
Data Technologies and Applications | VOL. 57

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A reinforcement learning framework for improving parking decisions in last-mile delivery

Abstract

Talk to us

Similar Papers

More From: Transportmetrica B: Transport Dynamics