Welfare Maximization Algorithm for Solving Budget-Constrained Multi-Component POMDPs

Manav Vora,Pranay Thangeda,Melkior Ornik,Michael N Grussing

doi:10.1109/lcsys.2023.3280080

Abstract

Partially Observable Markov Decision Processes (POMDPs) provide an efficient way to model real-world sequential decision making processes. Motivated by the problem of maintenance and inspection of a group of infrastructure components with independent dynamics, this letter presents an algorithm to find the optimal policy for a multi-component budget-constrained POMDP. We first introduce a budgeted-POMDP model (b-POMDP) which enables us to find the optimal policy for a POMDP while adhering to budget constraints. Next, we prove that the value function or maximal collected reward for a special class of b-POMDPs is a concave function of the budget for the finite horizon case. Our second contribution is an algorithm to calculate the optimal policy for a multi-component budget-constrained POMDP by finding the optimal budget split among the individual component POMDPs. The optimal budget split is posed as a welfare maximization problem and the solution is computed by exploiting the concavity of the value function. We illustrate the effectiveness of the proposed algorithm by proposing a maintenance and inspection policy for a group of real-world infrastructure components with different deterioration dynamics, inspection and maintenance costs. We show that the proposed algorithm vastly outperforms the policies currently used in practice.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Welfare Maximization Algorithm for Solving Budget-Constrained Multi-Component POMDPs

Abstract

Talk to us

Similar Papers

More From: IEEE Control Systems Letters

Lead the way for us

Journal: IEEE Control Systems Letters	Publication Date: Jan 1, 2023
License type: CC BY 4.0

Similar Papers

A Bayesian game based adaptive fuzzy controller for multiagent POMDPs
Rajneesh Sharma ... Matthijs T J Spaan
-
Rajneesh Sharma, et. al.Rajneesh Sharma ... Matthijs T J Spaan
01 Jul 2010
01 Jul 2010

POMDP-based online target detection and recognition for autonomous UAVs
...
-
, et. al. ...
03 Sep 2014
03 Sep 2014

Task-Based Decomposition of Factored POMDPs
Guy Shani
IEEE Transactions on Cybernetics | VOL. 44
Guy ShaniGuy Shani
01 Feb 2014
IEEE Transactions on Cybernetics | VOL. 44

POMDPs for sustainable fishery management
...
-
, et. al. ...
01 Dec 2019
01 Dec 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Welfare Maximization Algorithm for Solving Budget-Constrained Multi-Component POMDPs

Abstract

Talk to us

Similar Papers

More From: IEEE Control Systems Letters