Abstract

AbstractIn present times, data science become popular to support and improve decision-making process. Due to the accessibility of a wide application perspective of data streaming, class imbalance and concept drifting become crucial learning problems. The advent of deep learning (DL) models finds useful for the classification of concept drift in data streaming applications. This paper presents an effective class imbalance with concept drift detection (CIDD) using Adadelta optimizer-based deep neural networks (ADODNN), named CIDD-ADODNN model for the classification of highly imbalanced streaming data. The presented model involves four processes namely preprocessing, class imbalance handling, concept drift detection, and classification. The proposed model uses adaptive synthetic (ADASYN) technique for handling class imbalance data, which utilizes a weighted distribution for diverse minority class examples based on the level of difficulty in learning. Next, a drift detection technique called adaptive sliding window (ADWIN) is employed to detect the existence of the concept drift. Besides, ADODNN model is utilized for the classification processes. For increasing the classifier performance of the DNN model, ADO-based hyperparameter tuning process takes place to determine the optimal parameters of the DNN model. The performance of the presented model is evaluated using three streaming datasets namely intrusion detection (NSL KDDCup) dataset, Spam dataset, and Chess dataset. A detailed comparative results analysis takes place and the simulation results verified the superior performance of the presented model by obtaining a maximum accuracy of 0.9592, 0.9320, and 0.7646 on the applied KDDCup, Spam, and Chess dataset, respectively.

Highlights

  • With the progressive technical advancements, numerous data streams are produced robustly in recent times

  • The concept drift detection is a strategy while the changes in data distribution make recent prediction method as inaccurate

  • Adadelta optimizer-based deep neural networks (ADODNN) model is utilized for the classification processes

Read more

Summary

Introduction

With the progressive technical advancements, numerous data streams are produced robustly in recent times. Sudden concept drift is represented by the massive changes from basic class distribution as well as the incoming samples within the time duration. Once the sampling process is applied, the two class problems are constant in stream data classification. Assume the online ensemble classifier Θ which receives novel instance xt at t time, and detected class label is y′t. Because of the imbalance in distribution of samples between majority and minority class instances, the classifier performance gets degraded. The concept drift detection is a strategy while the changes in data distribution make recent prediction method as inaccurate. The stream data classifier with no concept drift adaptation is not desirable to classify imbalance class distribution. This paper designs a novel class imbalance with concept drift detection (CIDD) using Adadelta optimizer-based deep neural networks (ADODNN), named CIDD-ADODNN model to classify highly imbalance data streams. Validate the performance of the CIDD-ADODNN model, three streaming datasets

Literature survey
Procedure
Methods
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call