Reinforcement Learning for Branch-and-Bound Optimisation Using Retrospective Trajectories

Christopher W F Parsonson,Alexandre Laterre,Thomas D Barrett

doi:10.1609/aaai.v37i4.25521

Abstract

Combinatorial optimisation problems framed as mixed integer linear programmes (MILPs) are ubiquitous across a range of real-world applications. The canonical branch-and-bound algorithm seeks to exactly solve MILPs by constructing a search tree of increasingly constrained sub-problems. In practice, its solving time performance is dependent on heuristics, such as the choice of the next variable to constrain ('branching'). Recently, machine learning (ML) has emerged as a promising paradigm for branching. However, prior works have struggled to apply reinforcement learning (RL), citing sparse rewards, difficult exploration, and partial observability as significant challenges. Instead, leading ML methodologies resort to approximating high quality handcrafted heuristics with imitation learning (IL), which precludes the discovery of novel policies and requires expensive data labelling. In this work, we propose retro branching; a simple yet effective approach to RL for branching. By retrospectively deconstructing the search tree into multiple paths each contained within a sub-tree, we enable the agent to learn from shorter trajectories with more predictable next states. In experiments on four combinatorial tasks, our approach enables learning-to-branch without any expert guidance or pre-training. We outperform the current state-of-the-art RL branching algorithm by 3-5x and come within 20% of the best IL method's performance on MILPs with 500 constraints and 1000 variables, with ablations verifying that our retrospectively constructed trajectories are essential to achieving these results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reinforcement Learning for Branch-and-Bound Optimisation Using Retrospective Trajectories

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 3

Similar Papers

Towards Generalization and Efficiency in Reinforcement Learning

-

02 Jul 2019
02 Jul 2019

Applying reinforcement learning and tree search to the unit commitment problem
Patrick De Mars ... Aidan O’Sullivan
Applied Energy | VOL. 302
Patrick De Mars, et. al.Patrick De Mars ... Aidan O’Sullivan
07 Aug 2021
Applied Energy | VOL. 302

Sample-Efficient I-Projections for Robot Learning

-

19 Apr 2021
19 Apr 2021

A Comprehensive Survey of Machine Learning Methodologies with Emphasis in Water Resources Management
Maria Drogkoula ... Konstantinos Kokkinos
Applied Sciences | VOL. 13
Maria Drogkoula, et. al.Maria Drogkoula ... Konstantinos Kokkinos
08 Nov 2023
Applied Sciences | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning for Branch-and-Bound Optimisation Using Retrospective Trajectories

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence