An extended ϵ‐constraint method for a multiobjective finite‐horizon Markov decision process

Maryam Eghbali‐Zarch,Kazem Dehghan‐Sanej,Reza Tavakkoli‐Moghaddam,Amir Azaron

doi:10.1111/itor.12989

Abstract

AbstractA Markov decision process (MDP) is an appropriate mathematical framework for analysis and modeling a large class of sequential decision‐making problems. Real‐world applications necessitate the evaluation of the value of a decision according to several conflicting objectives. This paper presents an extended ϵ‐constraint method for a multiobjective finite‐horizon MDP. This study integrates the ϵ‐constraint method with the K‐best policies algorithm to find the nondominated deterministic Markovian policies on the Pareto‐optimal frontier. The proposed algorithm is evaluated on biobjective maintenance scheduling and machine running speed selection problems, and its performance is compared with a classic approach in the literature (weighted‐sum, WS, method). Satisfying results show that the proposed algorithm obtains a good‐quality Pareto frontier and has advantages over the WS method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An extended ϵ‐constraint method for a multiobjective finite‐horizon Markov decision process

Abstract

Talk to us

Similar Papers

More From: International Transactions in Operational Research

Lead the way for us

Journal: International Transactions in Operational Research	Publication Date: May 4, 2021
Citations: 2

Similar Papers

Pareto ant colony optimization based algorithm to solve maintenance and production scheduling problem in parallel machine case
A Berrichi ... F Yalaoui
-
A Berrichi, et. al.A Berrichi ... F Yalaoui
01 Jul 2009
01 Jul 2009

Application of Markov Decision Process in Generating Units Maintenance Scheduling
A Rajabi-Ghahnavie ... M Fotuhi-Firuzabad
-
A Rajabi-Ghahnavie, et. al.A Rajabi-Ghahnavie ... M Fotuhi-Firuzabad
01 Jun 2006
01 Jun 2006

Contraction Mappings in the Theory Underlying Dynamic Programming
Eric V Denardo
SIAM Review | VOL. 9
Eric V DenardoEric V Denardo
01 Apr 1967
SIAM Review | VOL. 9

Replacement policy for a single-component machine with limited spares in a finite time horizon
Y Wang ... Y Li
-
Y Wang, et. al.Y Wang ... Y Li
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An extended ϵ‐constraint method for a multiobjective finite‐horizon Markov decision process

Abstract

Talk to us

Similar Papers

More From: International Transactions in Operational Research