TRACK-Plus: Optimizing Artificial Neural Networks for Hybrid Anomaly Detection in Data Streaming Systems

Ahmad Alnafessah,Giuliano Casale

doi:10.1109/access.2020.3015346

Ahmad Alnafessah, Giuliano Casale

Open Access

https://doi.org/10.1109/access.2020.3015346

Copy DOI

Abstract

Software applications can feature intrinsic variability in their execution time due to interference from other applications or software contention from other users, which may lead to unexpectedly long running times and anomalous performance. There is thus a need for effective automated performance anomaly detection methods that can be used within production environments to avoid any late detection of unexpected degradations of service level. To address this challenge, we introduce TRACK-Plus a black-box training methodology for performance anomaly detection. The method uses an artificial neural networks-driven methodology and Bayesian Optimization to identify anomalous performance and are validated on Apache Spark Streaming. TRACK-Plus has been extensively validated using a real Apache Spark Streaming system and achieve a high F-score while simultaneously reducing training time by 80% compared to efficiently detect anomalies.

Highlights

In-memory processing technologies used for Big Data have been widely adopted in industry, in particular, Apache Spark has drawn particular attention because of its speed, generality, and ease of use
We describe TRACK and TRACK-Plus, two methods to efficiently train a class of machine learning models for performance anomaly detection using a fixed number of experiments
Some performance anomaly identification studies and surveys have been conducted in the literature for different purposes [14], [17], [23], [24]; there is still a shortage of studies that propose efficient automated anomaly detection, especially for in-memory Big Data stream processing technologies as we study

Summary

Introduction

In-memory processing technologies used for Big Data have been widely adopted in industry, in particular, Apache Spark has drawn particular attention because of its speed, generality, and ease of use. It is clear that the neural networks model fails to detect the CPU anomalies when the streaming workload configuration is changed. The model requires additional training with more possible configuration parameters to detect anomalies. This baseline experiment demonstrates the critical need for a solution that would find the optimal dataset size and configuration parameters of a streaming workload for training the anomaly detection model within an in-memory Big Data system for generalization purposes. The final output data from Spark Streaming can be pushed out to databases or other systems [43]

Objectives

Methods

Results

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

TRACK-Plus: Optimizing Artificial Neural Networks for Hybrid Anomaly Detection in Data Streaming Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

StreamApprox
Do Le Quoc ... Pramod Bhatotia
-
Do Le Quoc, et. al.Do Le Quoc ... Pramod Bhatotia
11 Dec 2017
11 Dec 2017

AI Driven Methodology for Anomaly Detection in Apache Spark Streaming Systems
Ahmad Alnafessah ... Giuliano Casale
-
Ahmad Alnafessah, et. al.Ahmad Alnafessah ... Giuliano Casale
01 Mar 2020
01 Mar 2020

Performance Evaluation of Intrusion Detection Streaming Transactions Using Apache Kafka and Spark Streaming
May Thet Tun ... Myat Pwint Phyu
-
May Thet Tun, et. al.May Thet Tun ... Myat Pwint Phyu
01 Nov 2019
01 Nov 2019

Comprehensive Performance Comparison Between Flink and Spark Streaming for Real-Time Health Score Service in Manufacturing
Seungchul Lee ... Donghwan Kim
-
Seungchul Lee, et. al.Seungchul Lee ... Donghwan Kim
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TRACK-Plus: Optimizing Artificial Neural Networks for Hybrid Anomaly Detection in Data Streaming Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access