Dynamically Adjusting Diversity in Ensembles for the Classification of Data Streams with Concept Drift

Juan I G Hidalgo,Silas G T C Santos,Roberto S M Barros

doi:10.1145/3466616

Abstract

A data stream can be defined as a system that continually generates a lot of data over time. Today, processing data streams requires new demands and challenging tasks in the data mining and machine learning areas. Concept Drift is a problem commonly characterized as changes in the distribution of the data within a data stream. The implementation of new methods for dealing with data streams where concept drifts occur requires algorithms that can adapt to several scenarios to improve its performance in the different experimental situations where they are tested. This research proposes a strategy for dynamic parameter adjustment in the presence of concept drifts. Parameter Estimation Procedure (PEP) is a general method proposed for dynamically adjusting parameters which is applied to the diversity parameter (λ) of several classification ensembles commonly used in the area. To this end, the proposed estimation method (PEP) was used to create Boosting-like Online Learning Ensemble with Parameter Estimation (BOLE-PE), Online AdaBoost-based M1 with Parameter Estimation (OABM1-PE), and Oza and Russell’s Online Bagging with Parameter Estimation (OzaBag-PE), based on the existing ensembles BOLE, OABM1, and OzaBag, respectively. To validate them, experiments were performed with artificial and real-world datasets using Hoeffding Tree (HT) as base classifier. The accuracy results were statistically evaluated using a variation of the Friedman test and the Nemenyi post-hoc test. The experimental results showed that the application of the dynamic estimation in the diversity parameter (λ) produced good results in most scenarios, i.e., the modified methods have improved accuracy in the experiments with both artificial and real-world datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dynamically Adjusting Diversity in Ensembles for the Classification of Data Streams with Concept Drift

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Knowledge Discovery from Data

Lead the way for us

Journal: ACM Transactions on Knowledge Discovery from Data	Publication Date: Jul 21, 2021
Citations: 8

Similar Papers

Using Diversity Ensembles with Time Limits to Handle Concept Drift
Robert Van Camp
-
Robert Van CampRobert Van Camp
20 Dec 2016
20 Dec 2016

One-class classifiers with incremental learning and forgetting for data streams with concept drift
Bartosz Krawczyk ... Michał Woźniak
Soft Computing | VOL. 19
Bartosz Krawczyk, et. al.Bartosz Krawczyk ... Michał Woźniak
21 Oct 2014
Soft Computing | VOL. 19

An ensemble method for data stream classification in the presence of concept drift
Omid Abbaszadeh ... Ali Amiri
Frontiers of Information Technology & Electronic Engineering | VOL. 16
Omid Abbaszadeh, et. al.Omid Abbaszadeh ... Ali Amiri
01 Dec 2015
Frontiers of Information Technology & Electronic Engineering | VOL. 16

Comparative study of Fast Stacking Ensembles families algorithms
...
-
, et. al. ...
15 Oct 2020
15 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamically Adjusting Diversity in Ensembles for the Classification of Data Streams with Concept Drift

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Knowledge Discovery from Data