Managing engineering systems with large state and action spaces through deep reinforcement learning

C.P Andriotis,K.G Papakonstantinou

doi:10.1016/j.ress.2019.04.036

C.P Andriotis, K.G Papakonstantinou

Open Access

https://doi.org/10.1016/j.ress.2019.04.036

Copy DOI

Journal: Reliability Engineering & System Safety	Publication Date: Apr 26, 2019
Citations: 138	License type: elsevier-specific: oa user license

Affiliation: Pennsylvania State University

Abstract

Decision-making for engineering systems management can be efficiently formulated using Markov Decision Processes (MDPs) or Partially Observable MDPs (POMDPs). Typical MDP/POMDP solution procedures utilize offline knowledge about the environment and provide detailed policies for relatively small systems with tractable state and action spaces. However, in large multi-component systems the dimensions of these spaces easily explode, as system states and actions scale exponentially with the number of components, whereas environment dynamics are difficult to be described explicitly for the entire system and may, often, only be accessible through computationally expensive numerical simulators. In this work, to address these issues, an integrated Deep Reinforcement Learning (DRL) framework is introduced. The Deep Centralized Multi-agent Actor Critic (DCMAC) is developed, an off-policy actor-critic DRL algorithm that directly probes the state/belief space of the underlying MDP/POMDP, providing efficient life-cycle policies for large multi-component systems operating in high-dimensional spaces. Apart from deep network approximators parametrizing complex functions with vast state spaces, DCMAC also adopts a factorized representation of the system actions, thus being able to designate individualized component- and subsystem-level decisions, while maintaining a centralized value function for the entire system. DCMAC compares well against Deep Q-Network and exact solutions, where applicable, and outperforms optimized baseline policies that incorporate time-based, condition-based, and periodic inspection and maintenance considerations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Managing engineering systems with large state and action spaces through deep reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Reliability Engineering & System Safety

Lead the way for us

Similar Papers

Less is More
S Murugesan ... K H Drees
-
S Murugesan, et. al.S Murugesan ... K H Drees
17 Nov 2020
17 Nov 2020

Ambiguous Partially Observable Markov Decision Processes: Structural Results and Applications
Soroush Saghafian
SSRN Electronic Journal | VOL. -
Soroush SaghafianSoroush Saghafian
12 Oct 2014
SSRN Electronic Journal | VOL. -

Adaptive Discretization Using Voronoi Trees for Continuous-Action POMDPs
Marcus Hoerger ... Hanna Kurniawati
-
Marcus Hoerger, et. al.Marcus Hoerger ... Hanna Kurniawati
15 Dec 2022
15 Dec 2022

Ambiguous partially observable Markov decision processes: Structural results and applications
Soroush Saghafian
Journal of Economic Theory | VOL. 178
Soroush SaghafianSoroush Saghafian
20 Aug 2018
Journal of Economic Theory | VOL. 178

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Managing engineering systems with large state and action spaces through deep reinforcement learning

Abstract

Talk to us

Similar Papers

More From: Reliability Engineering &amp; System Safety

More From: Reliability Engineering & System Safety