Finite Horizon Partially Observable Semi-Markov Decision Processes under Risk Probability Criteria

Xin Wen,Xianping Guo,Li Xia

doi:10.1016/j.orl.2024.107187

Abstract

This paper deals with a risk probability minimization problem for finite horizon partially observable semi-Markov decision processes, which are the fairly most general models for stochastic dynamic systems. In contrast to the expected discounted and average criteria, the optimality investigated in this paper is to minimize the probability that the accumulated rewards do not reach a prescribed profit level at the finite terminal stage. First, the state space is augmented as the joint conditional distribution of the current unobserved state and the remaining profit goal. We introduce a class of policies depending on observable histories and a class of Markov policies including observable process with the joint conditional distribution. Then under mild assumptions, we prove that the value function is the unique solution to the optimality equation for the probability criterion by using iteration techniques. The existence of (ϵ-)optimal Markov policy for this problem is established. Finally, we use a bandit problem with the probability criterion to demonstrate our main results in which an effective algorithm and the corresponding numerical calculation are given for the semi-Markov model. Moreover, for the case of reduction to the discrete-time Markov model, we derive a concise solution.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Finite Horizon Partially Observable Semi-Markov Decision Processes under Risk Probability Criteria

Abstract

Talk to us

Similar Papers

More From: Operations Research Letters

Lead the way for us

Similar Papers

A Note on the Stability of Monotone Markov Chains
Bar Light
Operations Research Letters | VOL. 57
Bar LightBar Light
01 Nov 2024
Operations Research Letters | VOL. 57

Finite Horizon Partially Observable Semi-Markov Decision Processes under Risk Probability Criteria
Xin Wen ... Li Xia
Operations Research Letters | VOL. 57
Xin Wen, et. al.Xin Wen ... Li Xia
01 Nov 2024
Operations Research Letters | VOL. 57

Assessing the accuracy of externalities prediction in a LCFS-PR M/G/1 queue under partial information
Royi Jacobovic ... Nikki Levering
Operations Research Letters | VOL. 57
Royi Jacobovic, et. al.Royi Jacobovic ... Nikki Levering
01 Nov 2024
Operations Research Letters | VOL. 57

Fast Algorithms for Maximizing the Minimum Eigenvalue in Fixed Dimension
Adam Brown ... Mohit Singh
Operations Research Letters | VOL. 57
Adam Brown, et. al.Adam Brown ... Mohit Singh
01 Nov 2024
Operations Research Letters | VOL. 57

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Finite Horizon Partially Observable Semi-Markov Decision Processes under Risk Probability Criteria

Abstract

Talk to us

Similar Papers

More From: Operations Research Letters