The study presents a reinforcement learning (RL) method called the Q-learning algorithm to determine the best maintenance policy for equipment. This involves an artificial intelligence agent making decisions and an environment representing the equipment. The agent creates maintenance policies and takes actions, while the environment determines state transitions and rewards based on the actions chosen. The optimization of the maintenance policy starts with predicting equipment reliability using sensor data. This prediction method combines Back Propagation Neural Network (BPNN) algorithms with the Boxing Match Algorithm (BMA), an evolutionary meta-heuristic algorithm. A novel weight update strategy enhances the performance of artificial neural networks in reliability prediction. This integrated model, BMA-BPNN, aims to improve the accuracy of forecasted reliability. The study involves forecasting equipment reliability to determine critical levels, incorporating cost and risk considerations into decision-making for optimizing maintenance policies. As a result, the Q-learning algorithm is used to identify the best maintenance actions based on equipment reliability. Implementing an automated maintenance system that considers equipment reliability and costs can help reduce accidents resulting from maintenance program deficiencies. This study thus contributes to the field by providing an efficient approach to equipment maintenance.
Read full abstract