A linear programming based approach for composite-action Markov decision processes

Zhicong Zhang,Liangwei Zhang,Xiaohui Yan,Shuai Li

doi:10.1051/ro/2018081

Abstract

We study a time homogeneous discrete composite-action Markov decision process (CMDP) which needs to make multiple decisions at each state. In this particular Markov decision process, the state variables are divided into two separable sets and a two-dimensional composite action is chosen at each decision epoch. To solve a composite-action Markov decision process, we propose a novel linear programming model (Contracted Linear Programming Model, CLPM). We show that the CLPM model obtains the optimal state values of a CMDP process. We analyze and compare the number of variables and constraints of the CLPM model and the Traditional Linear Programming Model (TLPM). Computational experiments compare running times and memory usage of the two models. The CLPM model outperforms the TLPM model in both time complexity and space complexity by theoretical analysis and computational experiments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A linear programming based approach for composite-action Markov decision processes

Abstract

Talk to us

Similar Papers

More From: RAIRO - Operations Research

Lead the way for us

Similar Papers

Randomized Objective Function Linear Programming in Risk Management
Dennis Ridley ... Inger Daniels
Journal of Applied Mathematics and Physics | VOL. 09
Dennis Ridley, et. al.Dennis Ridley ... Inger Daniels
01 Jan 2020
Journal of Applied Mathematics and Physics | VOL. 09

Study on Linear Programming in Risk Management
Dennis Ridley ... Abdullah Khan
-
Dennis Ridley, et. al.Dennis Ridley ... Abdullah Khan
20 Apr 2022
20 Apr 2022

Timber harvest scheduling in a fuzzy decision environment
B Bruce Bare ... Guillermo A Mendoza
Canadian Journal of Forest Research | VOL. 22
B Bruce Bare, et. al.B Bruce Bare ... Guillermo A Mendoza
01 Apr 1992
Canadian Journal of Forest Research | VOL. 22

An Application of LFP Method for Sintering Ore Ratio
Xi Cheng ... Yunfeng Ma
-
Xi Cheng, et. al.Xi Cheng ... Yunfeng Ma
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A linear programming based approach for composite-action Markov decision processes

Abstract

Talk to us

Similar Papers

More From: RAIRO - Operations Research