Solving biobjective traveling thief problems with multiobjective reinforcement learning

Gemilang Santiyuda,Retantyo Wardoyo,Reza Pulungan

doi:10.1016/j.asoc.2024.111751

Abstract

This study proposes an end-to-end multiobjective reinforcement learning (MORL) approach to solve the biobjective traveling thief problems (TTP). A TTP involves a thief visiting cities and selecting items to maximize profit while minimizing travel time within a knapsack’s capacity. The study evaluates combinations of two architectures, namely the pointer network (PN) and attention mechanism (AM), with three MORL methods: deep reinforcement learning multiobjective algorithm (DRLMOA), multi-sample Pareto hypernetwork (PHN), and manifold-based policy search (MBPS). However, PN and AM cannot be directly used to predict two different sequences simultaneously: the city tour and the item selection. Therefore, a solution encoding and decoding scheme is proposed to solve TTP without substantially modifying PN and AM. The methods are trained on only small randomly generated problem instances based on Eil76 instances, and their performance is evaluated on various problem instances. The state-of-the-art non-dominated sorting-based customized random-key genetic algorithm (NDS-BRKGA) serves as the baseline. The experimental study demonstrates a competitive performance of the proposed methods compared to the baseline, particularly in instances with a high number of items. The proposed methods, especially PN-DRLMOA and AM-DRLMOA, also show promising generalization capabilities on different and larger graphs. Lastly, the proposed MORL methods significantly outperform NDS-BRKGA in terms of the solution generation running time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Solving biobjective traveling thief problems with multiobjective reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Similar Papers

An Integrated Generation-Compensation optimization Strategy for Enhanced Short-Term Voltage Security of Large-Scale Power Systems Using Multi-Objective Reinforcement Learning Method
Zhuoming Deng ... Mingbo Liu
Control theory & applications | VOL. -
Zhuoming Deng, et. al.Zhuoming Deng ... Mingbo Liu
01 Nov 2018
Control theory & applications | VOL. -

Multi-objective safe reinforcement learning
Naoto Horie ... Nobuhiro Inuzuka
Artificial Life and Robotics | VOL. -
Naoto Horie, et. al.Naoto Horie ... Nobuhiro Inuzuka
18 Jan 2019
Artificial Life and Robotics | VOL. -

Nondominated Policy-Guided Learning in Multi-Objective Reinforcement Learning
Man-Je Kim ... Chang Wook Ahn
Electronics | VOL. 11
Man-Je Kim, et. al.Man-Je Kim ... Chang Wook Ahn
28 Mar 2022
Electronics | VOL. 11

Model-Based Multi-Objective Reinforcement Learning by a Reward Occurrence Probability Vector
Tomohiro Yamaguchi ... Shota Nagahama
-
Tomohiro Yamaguchi, et. al.Tomohiro Yamaguchi ... Shota Nagahama
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Solving biobjective traveling thief problems with multiobjective reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing