Guided probabilistic reinforcement learning for sampling-efficient maintenance scheduling of multi-component system

Yiming Zhang,Dingyang Zhang,Xiaoge Zhang,Lemiao Qiu,Felix T.S Chan,Zili Wang,Shuyou Zhang

doi:10.1016/j.apm.2023.03.025

Abstract

In recent years, multi-agent deep reinforcement learning has progressed rapidly as reflected by its increasing adoptions in industrial applications. This paper proposes a Guided Probabilistic Reinforcement Learning (Guided-PRL) model to tackle maintenance scheduling of multi-component systems in the presence of uncertainty with the goal of minimizing the overall life-cycle cost. The proposed Guided-PRL is deeply rooted in the Actor-Critic (AC) scheme. Since traditional AC falls short in sampling efficiency and suffers from getting stuck in local minima in the context of multi-agent reinforcement learning, it is thus challenging for the actor network to converge to a solution of desirable quality even when the critic network is properly configured. To address these issues, we develop a generic framework to facilitate effective training of the actor network, and the framework consists of environmental reward modeling, degradation formulation, state representation, and policy optimization. The convergence speed of the actor network is significantly improved with a guided sampling scheme for environment exploration by exploiting rules-based domain expert policies. To handle data scarcity, the environmental modeling and policy optimization are approximated with Bayesian models for effective uncertainty quantification. The Guided-PRL model is evaluated using the simulations of a 12-component system as well as GE90 and CFM56 engines. Compared with four alternative deep reinforcement learning schemes, the Guided-PRL lowers life-cycle cost by 34.92% to 88.07%. In comparison with rules-based expert policies, the Guided-PRL decreases the life-cycle cost by 23.26% to 51.36%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Guided probabilistic reinforcement learning for sampling-efficient maintenance scheduling of multi-component system

Abstract

Talk to us

Similar Papers

More From: Applied Mathematical Modelling

Lead the way for us

Journal: Applied Mathematical Modelling	Publication Date: Mar 22, 2023
Citations: 5

Similar Papers

Deep Reinforcement Learning
Aske Plaat
-
Aske PlaatAske Plaat
01 Jan 2021
01 Jan 2021

Independent Learning Approaches: Overcoming Multi-Agent Learning Pathologies In Team-Games

-

06 Mar 2020
06 Mar 2020

Multi-Agent Deep Reinforcement Learning Based Distributed Resource Allocation
Odilbek Urmonov ... Hyungwon Kim
-
Odilbek Urmonov, et. al.Odilbek Urmonov ... Hyungwon Kim
01 May 2021
01 May 2021

Deep Reinforcement Learning: A New Frontier in Computer Vision Research
Sejuti Rahman ... A K M Nadimul Haque
-
Sejuti Rahman, et. al.Sejuti Rahman ... A K M Nadimul Haque
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Guided probabilistic reinforcement learning for sampling-efficient maintenance scheduling of multi-component system

Abstract

Talk to us

Similar Papers

More From: Applied Mathematical Modelling