Estimating Objective Weights of Pareto-Optimal Policies for Multi-Objective Sequential Decision-Making

Akiko Ikenaga,Sachiyo Arai

doi:10.20965/jaciii.2024.p0393

Abstract

Sequential decision-making under multiple objective functions includes the problem of exhaustively searching for a Pareto-optimal policy and the problem of selecting a policy from the resulting set of Pareto-optimal policies based on the decision maker’s preferences. This paper focuses on the latter problem. In order to select a policy that reflects the decision maker’s preferences, it is necessary to order these policies, which is problematic because the decision-maker’s preferences are generally tacit knowledge. Furthermore, it is difficult to order them quantitatively. For this reason, conventional methods have mainly been used to elicit preferences through dialogue with decision-makers and through one-to-one comparisons. In contrast, this paper proposes a method based on inverse reinforcement learning to estimate the weight of each objective from the decision-making sequence. The estimated weights can be used to quantitatively evaluate the Pareto-optimal policies from the viewpoints of the decision-makers preferences. We applied the proposed method to the multi-objective reinforcement learning benchmark problem and verified its effectiveness as an elicitation method of weights for each objective function.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Estimating Objective Weights of Pareto-Optimal Policies for Multi-Objective Sequential Decision-Making

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Advanced Computational Intelligence and Intelligent Informatics

Lead the way for us

Journal: Journal of Advanced Computational Intelligence and Intelligent Informatics	Publication Date: Mar 20, 2024
License type: cc-by-nd

Similar Papers

Model-Based Multi-objective Reinforcement Learning with Unknown Weights
Tomohiro Yamaguchi ... Keiki Takadama
-
Tomohiro Yamaguchi, et. al.Tomohiro Yamaguchi ... Keiki Takadama
01 Jan 2019
01 Jan 2019

Multi-objective reinforcement learning method for acquiring all pareto optimal policies simultaneously
Yusuke Mukai ... Yasuaki Kuroe
-
Yusuke Mukai, et. al.Yusuke Mukai ... Yasuaki Kuroe
01 Oct 2012
01 Oct 2012

Formalizing Model-Based Multi-Objective Reinforcement Learning With a Reward Occurrence Probability Vector
Tomohiro Yamaguchi ... Yuto Kawabuchi
-
Tomohiro Yamaguchi, et. al.Tomohiro Yamaguchi ... Yuto Kawabuchi
01 Jan 2021
01 Jan 2021

Self-Optimizing and Pareto-Optimal Policies in General Environments Based on Bayes-Mixtures
Marcus Hutter
-
Marcus HutterMarcus Hutter
01 Jan 2002
01 Jan 2002

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Estimating Objective Weights of Pareto-Optimal Policies for Multi-Objective Sequential Decision-Making

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Advanced Computational Intelligence and Intelligent Informatics