Partially Observable Markov Decision Process Policy Research Articles

Civil and maritime engineering systems, among others, from bridges to offshore platforms and wind turbines, must be efficiently managed, as they are exposed to deterioration mechanisms throughout their operational life, such as fatigue and/or corrosion. Identifying optimal inspection and maintenance policies demands the solution of a complex sequential decision-making problem under uncertainty, with the main objective of efficiently controlling the risk associated with structural failures. Addressing this complexity, risk-based inspection planning methodologies, supported often by dynamic Bayesian networks, evaluate a set of pre-defined heuristic decision rules to reasonably simplify the decision problem. However, the resulting policies may be compromised by the limited space considered in the definition of the decision rules. Avoiding this limitation, Partially Observable Markov Decision Processes (POMDPs) provide a principled mathematical methodology for stochastic optimal control under uncertain action outcomes and observations, in which the optimal actions are prescribed as a function of the entire, dynamically updated, state probability distribution. In this paper, we combine dynamic Bayesian networks with POMDPs in a joint framework for optimal inspection and maintenance planning, and we provide the relevant formulation for developing both infinite and finite horizon POMDPs in a structural reliability context. The proposed methodology is implemented and tested for the case of a structural component subject to fatigue deterioration, demonstrating the capability of state-of-the-art point-based POMDP solvers of solving the underlying planning stochastic optimization problem. Within the numerical experiments, POMDP and heuristic-based policies are thoroughly compared, and results showcase that POMDPs achieve substantially lower costs as compared to their counterparts, even for traditional problem settings.

Read full abstract

Efficient integration of uncertain observations with decision-making optimization is key for prescribing informed intervention actions, able to preserve structural safety of deteriorating engineering systems. To this end, it is necessary that scheduling of inspection and monitoring strategies be objectively performed on the basis of their expected value-based gains that, among others, reflect quantitative metrics such as the Value of Information (VoI) and the Value of Structural Health Monitoring (VoSHM). In this work, we introduce and study the theoretical and computational foundations of the above metrics within the context of Partially Observable Markov Decision Processes (POMDPs), thus alluding to a broad class of decision-making problems of partially observable stochastic deteriorating environments that can be modeled as POMDPs. Step-wise and life-cycle VoI and VoSHM definitions are devised and their bounds are analyzed as per the properties stemming from the Bellman equation and the resulting optimal value function. It is shown that a POMDP policy inherently leverages the notion of VoI to guide observational actions in an optimal way at every decision step, and that the permanent or intermittent information provided by SHM or inspection visits, respectively, can only improve the cost of this policy in the long-term, something that is not necessarily true under locally optimal policies, typically adopted in decision-making of structures and infrastructure. POMDP solutions are derived based on point-based value iteration methods, and the various definitions are quantified in stationary and non-stationary deteriorating environments, with both infinite and finite planning horizons, featuring single- or multi-component engineering systems.

Read full abstract

Partially Observable Markov Decision Process Policy Research Articles

Related Topics

Articles published on Partially Observable Markov Decision Process Policy

Strong Simple Policies for POMDPs

Risk-Averse Decision Making Under Uncertainty

Welfare Maximization Algorithm for Solving Budget-Constrained Multi-Component POMDPs

Inspection and maintenance planning for offshore wind structural components: integrating fatigue failure criteria with Bayesian networks and Markov decision processes

Task-Aware Verifiable RNN-Based Policies for Partially Observable Markov Decision Processes

Optimal inspection and maintenance planning for deteriorating structural components through dynamic Bayesian networks and Markov decision processes

Value of structural health information in partially observable stochastic environments

POMHDP: Search-Based Belief Space Planning Using Multiple Heuristics

Observation-Based Optimization for POMDPs With Continuous State, Observation, and Action Spaces

Extending the Applicability of POMDP Solutions to Robotic Tasks

Decision-theoretic planning under uncertainty with information rewards for active cooperative perception

Energy Efficient Execution of POMDP Policies.

Dialogue POMDP components (part I): learning states and observations

Dialogue POMDP components (Part II): learning the reward function

Decentralized Multi-Robot Cooperation with Auctioned POMDPs

A Novel Point-Based Incremental Pruning Algorithm for POMDP

Task-Based Decomposition of Factored POMDPs

A Conceptual-operative Framework for in-process Decision Support of Software Project Management Practice

Stochastic spectrum handoff protocols for partially observable cognitive radio networks

Decentralized multi-robot cooperation with auctioned POMDPs

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Partially Observable Markov Decision Process Policy Research Articles

Related Topics

Articles published on Partially Observable Markov Decision Process Policy

Strong Simple Policies for POMDPs

Risk-Averse Decision Making Under Uncertainty

Welfare Maximization Algorithm for Solving Budget-Constrained Multi-Component POMDPs

Inspection and maintenance planning for offshore wind structural components: integrating fatigue failure criteria with Bayesian networks and Markov decision processes

Task-Aware Verifiable RNN-Based Policies for Partially Observable Markov Decision Processes

Optimal inspection and maintenance planning for deteriorating structural components through dynamic Bayesian networks and Markov decision processes

Value of structural health information in partially observable stochastic environments

POMHDP: Search-Based Belief Space Planning Using Multiple Heuristics

Observation-Based Optimization for POMDPs With Continuous State, Observation, and Action Spaces

Extending the Applicability of POMDP Solutions to Robotic Tasks

Decision-theoretic planning under uncertainty with information rewards for active cooperative perception

Energy Efficient Execution of POMDP Policies.

Dialogue POMDP components (part I): learning states and observations

Dialogue POMDP components (Part II): learning the reward function

Decentralized Multi-Robot Cooperation with Auctioned POMDPs

A Novel Point-Based Incremental Pruning Algorithm for POMDP

Task-Based Decomposition of Factored POMDPs

A Conceptual-operative Framework for in-process Decision Support of Software Project Management Practice

Stochastic spectrum handoff protocols for partially observable cognitive radio networks

Decentralized multi-robot cooperation with auctioned POMDPs