Abstract

With the increasing population of Industry 4.0, industrial big data (IBD) has become a hotly discussed topic in digital and intelligent industry field. The security problem existing in the signal processing on large scale of data stream is still a challenge issue in industrial internet of things, especially when dealing with the high-dimensional anomaly detection for intelligent industrial application. In this article, to mitigate the inconsistency between dimensionality reduction and feature retention in imbalanced IBD, we propose a variational long short-term memory (VLSTM) learning model for intelligent anomaly detection based on reconstructed feature representation. An encoder-decoder neural network associated with a variational reparameterization scheme is designed to learn the low-dimensional feature representation from high-dimensional raw data. Three loss functions are defined and quantified to constrain the reconstructed hidden variable into a more explicit and meaningful form. A lightweight estimation network is then fed with the refined feature representation to identify anomalies in IBD. Experiments using a public IBD dataset named UNSW-NB15 demonstrate that the proposed VLSTM model can efficiently cope with imbalance and high-dimensional issues, and significantly improve the accuracy and reduce the false rate in anomaly detection for IBD according to F1, area under curve (AUC), and false alarm rate (FAR).

Highlights

  • With the rapid development of Industry 4.0, more and more industrial applications, empowered by intelligent and real-time signal processing, are connected interactively, due to the increasingly wide use of wireless network technology with diversified smart devices in Industrial Internet of Things (IIoT)

  • The six conventional learning methods may perform well on the validation data, the highest results of F1, False Alarm Rate (FAR), and AUC shown in Table 3 demonstrate the efficiency of our Variational Long Short-Term Memory (VLSTM) method in improving the accuracy of classification tasks, and reducing the false rate of anomaly detections for the imbalanced and high-dimensional data in Industrial Big Data (IBD) environments

  • A VLSTM learning model was designed to cope with the imbalance and high-dimensional issues, which could be applied for intelligent anomaly detection based on reconstructed feature representation in IBD environments

Read more

Summary

INTRODUCTION

With the rapid development of Industry 4.0, more and more industrial applications, empowered by intelligent and real-time signal processing, are connected interactively, due to the increasingly wide use of wireless network technology with diversified smart devices in Industrial Internet of Things (IIoT). It would be even worse for conventional classification methods to extract meaningful features from the imbalanced input data, especially when positive samples become extremely sparse in IBD environments. Different kinds of auto encoding techniques have been explored in intrusion detection and achieved great success in re-encoding high-dimensional features to lower dimension features [3] These methods could improve the accuracy of anomaly detection to a certain extent, the low False Alarm Rate (FAR) is still an unsolved issue especially when facing the imbalanced dataset. A novel anomaly detection model based on Variational Long Short-Term Memory (VLSTM) is designed to deal with the imbalanced and high-dimensional issues in IBD.

Issues on Intrusion Detection System
Machine Learning in Anomaly Detection
MODELING OF VLSTM
VLSTM Framework
Hidden Variable Reconstruction via Variational Bayes
Robust Constraint for Hidden Variable
VLSTM Enhanced Anomaly Detection Algorithm
7: Input Z into the LSTM decoder to get the reconstructed
Data Set
Experiment Design and Evaluation Metrics
Evaluation on Reparameterization Effectiveness
Analysis on Anomaly Detection Performance
Method
Discussion
Findings
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.