Reinforcement Learning from Optimization Proxy for Ride-Hailing Vehicle Relocation

Enpeng Yuan,Pascal Van Hentenryck,Wenbo Chen

doi:10.1613/jair.1.13794

Enpeng Yuan, Pascal Van Hentenryck + Show 1 more

Open Access

https://doi.org/10.1613/jair.1.13794

Copy DOI

Abstract

Idle vehicle relocation is crucial for addressing demand-supply imbalance that frequently arises in the ride-hailing system. Current mainstream methodologies - optimization and reinforcement learning - suffer from obvious computational drawbacks. Optimization models need to be solved in real-time and often trade off model fidelity (hence quality of solutions) for computational efficiency. Reinforcement learning is expensive to train and often struggles to achieve coordination among a large fleet. This paper designs a hybrid approach that leverages the strengths of the two while overcoming their drawbacks. Specifically, it trains an optimization proxy, i.e., a machine-learning model that approximates an optimization model, and then refines the proxy with reinforcement learning. This Reinforcement Learning from Optimization Proxy (RLOP) approach is computationally efficient to train and deploy, and achieves better results than RL or optimization alone. Numerical experiments on the New York City dataset show that the RLOP approach reduces both the relocation costs and computation time significantly compared to the optimization model, while pure reinforcement learning fails to converge due to computational complexity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Artificial Intelligence Research	Publication Date: Nov 28, 2022
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Reinforcement Learning from Optimization Proxy for Ride-Hailing Vehicle Relocation

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research

Lead the way for us

Similar Papers

Reinforcement Learning from Optimization Proxy for Ride-Hailing Vehicle Relocation (Extended Abstract)
Enpeng Yuan ... Pascal Van Hentenryck
-
Enpeng Yuan, et. al.Enpeng Yuan ... Pascal Van Hentenryck
01 Aug 2023
01 Aug 2023

Co-evolutionary Decision-Making Modeling Via Integration of Machine Learning and Optimization
Tatsushi Nishi
-
Tatsushi NishiTatsushi Nishi
01 Jan 2023
01 Jan 2023

Predicting solutions of large-scale optimization problems via machine learning: A case study in blood supply chain management
Babak Abbasi ... Maryam Dehghani
Computers & Operations Research | VOL. 119
Babak Abbasi, et. al.Babak Abbasi ... Maryam Dehghani
18 Mar 2020
Computers & Operations Research | VOL. 119

Matching functions for free-floating shared mobility system optimization to capture maximum walking distances
Matthias Soppert ... Prasanna M Bhogale
European Journal of Operational Research | VOL. 305
Matthias Soppert, et. al.Matthias Soppert ... Prasanna M Bhogale
04 Jul 2022
European Journal of Operational Research | VOL. 305

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning from Optimization Proxy for Ride-Hailing Vehicle Relocation

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research