Abstract

In this study, we proposed a data-driven approach to the condition monitoring of the marine engine. Although several unsupervised methods in the maritime industry have existed, the common limitation was the interpretation of the anomaly; they do not explain why the model classifies specific data instances as an anomaly. This study combines explainable AI techniques with anomaly detection algorithm to overcome the limitation above. As an explainable AI method, this study adopts Shapley Additive exPlanations (SHAP), which is theoretically solid and compatible with any kind of machine learning algorithm. SHAP enables us to measure the marginal contribution of each sensor variable to an anomaly. Thus, one can easily specify which sensor is responsible for the specific anomaly. To illustrate our framework, the actual sensor stream obtained from the cargo vessel collected over 10 months was analyzed. In this analysis, we performed hierarchical clustering analysis with transformed SHAP values to interpret and group common anomaly patterns. We showed that anomaly interpretation and segmentation using SHAP value provides more useful interpretation compared to the case without using SHAP value.

Highlights

  • Modern vessels utilize onboard sensors and data acquisition systems to collect ship performance and navigation parameters [1]

  • CoInncltuhsisiosntus dy, we proposed a data-driven approach to the condition monitoring of the mInartihnise setnugdiyn,ew

  • One can specify which sensor is responsible for the specific anomaly

Read more

Summary

Introduction

Modern vessels utilize onboard sensors and data acquisition systems to collect ship performance and navigation parameters [1]. Data-Driven Approach to Condition-Based Monitoring of Marine Engines. One of the promising data-driven approaches using vessel datasets is condition monitoring of the vessel main engine. Several researchers tried to analyze the essential sensor signals related to the main engine, such as power, speed, temperature, and pressures for selected engine components, to detect anomalous data points that may indicate critical information for the failure [15]. The ideal approach to data-driven condition monitoring would be to use the runto-failure data obtained from the actual engine failure event. If such data is available, a classification model can be directly applied to predict the failure from the sensor data. The dataset consists of 14 faulty conditions additional to normal behavior, and a machine learning model was adopted to discern failure types. Generating the simulated dataset is costly because it requires complex physical modeling of the system, which may be challenging to develop

Objectives
Findings
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call