Approximability and efficient algorithms for constrained fixed-horizon POMDPs with durative actions

Majid Khonji

doi:10.1016/j.artint.2023.103968

Abstract

Partially Observable Markov Decision Process (POMDP) is a fundamental model for probabilistic planning in stochastic domains. More recently, constrained POMDP and chance-constrained POMDP extend the model allowing constraints to be specified on some aspects of the policy in addition to the objective function. Despite their expressive power, these models assume all actions take a fixed duration, which poses a limitation in modeling real-world planning problems. In this work, we propose a unified model for durative POMDP and its constrained extensions. First, we convert these extensions into an Integer Linear Programming (ILP) formulation, which can be solved using existing solvers in the ILP literature. Second, a heuristic search approach is provided that can efficiently prune the search space, guided by solving successive partial ILP programs. Third, we give a theoretical analysis of the problem: unlike short-horizon POMDPs, with policies of a constant depth, which can be solved in polynomial time, the constrained extensions are NP-Hard even with a planning horizon of two and non-negative rewards. To alleviate that, we propose a Fully Polynomial Time Approximation Scheme (FPTAS) that computes (near) optimal deterministic policies in polynomial time. The FPTAS is among the best achievable in theory in terms of approximation ratio. Finally, evaluation results show that our approach is empirically superior to the state-of-the-art fixed-horizon chance-constrained POMDP solver.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Approximability and efficient algorithms for constrained fixed-horizon POMDPs with durative actions

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence

Lead the way for us

Similar Papers

Approximability of Constant-horizon Constrained POMDP
Majid Khonji ... Brian Williams
-
Majid Khonji, et. al.Majid Khonji ... Brian Williams
01 Aug 2019
01 Aug 2019

Modelling traditional Chinese medicine therapy planning with POMDP
Qi Feng ... Runshun Zhang
International Journal of Functional Informatics and Personalised Medicine | VOL. 4
Qi Feng, et. al.Qi Feng ... Runshun Zhang
01 Jan 2013
International Journal of Functional Informatics and Personalised Medicine | VOL. 4

Online Planning Algorithms for POMDPs
S Ross ... B Chaib-Draa
Journal of Artificial Intelligence Research | VOL. 32
S Ross, et. al.S Ross ... B Chaib-Draa
29 Jul 2008
Journal of Artificial Intelligence Research | VOL. 32

A Model Approximation Scheme for Planning in Partially Observable Stochastic Domains
N L Zhang ... W Liu
Journal of Artificial Intelligence Research | VOL. 7
N L Zhang, et. al.N L Zhang ... W Liu
01 Nov 1997
Journal of Artificial Intelligence Research | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Approximability and efficient algorithms for constrained fixed-horizon POMDPs with durative actions

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence