Discounted Markov Decision Processes with Constrained Costs: the decomposition approach

Abdellatif Semmouri,Mostafa Jourhmane,Bahaa Eddine Elbaghazaoui,S Krit

doi:10.1051/e3sconf/202122901047

Abdellatif Semmouri, Mostafa Jourhmane + Show 2 more

Open Access

PDF Available

https://doi.org/10.1051/e3sconf/202122901047

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

In this paper we consider a constrained optimization of discrete time Markov Decision Processes (MDPs) with finite state and action spaces, which accumulate both a reward and costs at each decision epoch. We will study the problem of finding a policy that maximizes the expected total discounted reward subject to the constraints that the expected total discounted costs are not greater than given values. Thus, we will investigate the decomposition method of the state space into the strongly communicating classes for computing an optimal or a nearly optimal stationary policy. The discounted criterion has many applications in several areas such that the Forest Management, the Management of Energy Consumption, the finance, the Communication System (Mobile Networks) and the artificial intelligence.

Highlights

The decomposition method consists in dividing the space of states into subsets which are weakly coupled
We model in this work the environment as a Constrained Markov Decision Processes, defined by a tuple where S is the set of states, A is the set of actions, is the transition probability, is the reward function which denotes immediate reward incurred by taking action in state, is the cost function upper bounded by, of cost constraint, is the discount factor and is the initial fixed state
We will solve the problem of the constrained discounted Markov Decision Processes exploiting the decomposition of the state space into the strongly communicating classes by steps

Summary

Introduction

The decomposition method consists in dividing the space of states into subsets which are weakly coupled This technique was first introduced by Bather [1]. Following Ross and Varadarajan [5] have presented a similar decomposition method to solve the constrained problem of the long-time average Markov Decision Processes. In this decomposition, the state space is partitioned into Strongly Communicating Classes and a set (perhaps empty) of transient states. We will solve the problem of the constrained discounted Markov Decision Processes exploiting the decomposition of the state space into the strongly communicating classes by steps.

Preliminaries

Decomposition theory

Whilefor some DO

Restricted MDPs

Intermediate MDP

An optimal policy for the original MDP

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Discounted Markov Decision Processes with Constrained Costs: the decomposition approach

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: E3S Web of Conferences

Lead the way for us

Journal: E3S Web of Conferences	Publication Date: Jan 1, 2021
License type: CC BY 4.0

Similar Papers

Finite state approximations of Markov decision processes with general state and action spaces
Naci Saldi ... Tamas Linder
-
Naci Saldi, et. al.Naci Saldi ... Tamas Linder
01 Jul 2015
01 Jul 2015

A solving method of an mdp with a constraint by genetic algorithms
K Hirayama
Mathematical and Computer Modelling | VOL. 31
K HirayamaK Hirayama
01 May 2000
Mathematical and Computer Modelling | VOL. 31

Algorithms for aggregated limiting average Markov decision problems
Mohammed Abbad ... Cherki Daoui
Mathematical Methods of Operations Research (ZOR) | VOL. 53
Mohammed Abbad, et. al.Mohammed Abbad ... Cherki Daoui
01 Jul 2001
Mathematical Methods of Operations Research (ZOR) | VOL. 53

Discounted Markov decision processes with fuzzy costs
Abdellatif Semmouri ... Mostafa Jourhmane
Annals of Operations Research | VOL. 295
Abdellatif Semmouri, et. al.Abdellatif Semmouri ... Mostafa Jourhmane
07 Sep 2020
Annals of Operations Research | VOL. 295

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Discounted Markov Decision Processes with Constrained Costs: the decomposition approach

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: E3S Web of Conferences