Countering the Concept-Drift Problem in Big Data Using iOVFDT

Hang Yang,Simon Fong

doi:10.1109/bigdata.congress.2013.25

Abstract

How to efficiently uncover the knowledge hidden within massive and big data remains an open problem. One of the challenges is the issue of 'concept drift' in streaming data flows. Concept drift is a well-known problem in data analytics, in which the statistical properties of the attributes and their target classes shift over time, making the trained model less accurate. Many methods have been proposed for data mining in batch mode. Stream mining represents a new generation of data mining techniques, in which the model is updated in one pass whenever new data arrive. This one-pass mechanism is inherently adaptive and hence potentially more robust than its predecessors in handling concept drift in data streams. In this paper, we evaluate the performance of a family of decision-tree-based data stream mining algorithms. The advantage of incremental decision tree learning is the set of rules that can be extracted from the induced model. The extracted rules, in the form of predicate logics, can be used subsequently in many decision-support applications. However, the induced decision tree must be both accurate and compact, even in the presence of concept drift. We compare the performance of three typical incremental decision tree algorithms (VFDT [2], ADWIN [3], iOVFDT [4]) in dealing with concept-drift data. Both synthetic and real-world drift data are used in the experiment. iOVFDT is found to produce superior results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Countering the Concept-Drift Problem in Big Data Using iOVFDT

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

QuadCDD: A Quadruple-based Approach for Understanding Concept Drift in Data Streams
Pingfan Wang ... Wai Lok Woo
Expert Systems with Applications | VOL. 238
Pingfan Wang, et. al.Pingfan Wang ... Wai Lok Woo
25 Oct 2023
Expert Systems with Applications | VOL. 238

Handling Concept Drift in Data Streams by Using Drift Detection Methods
Malini M Patil
-
Malini M PatilMalini M Patil
08 Sep 2018
08 Sep 2018

Intrusion detection in the IoT data streams using concept drift localization
Renjie Chu ... Quanxi Feng
AIMS Mathematics | VOL. 9
Renjie Chu, et. al.Renjie Chu ... Quanxi Feng
01 Jan 2023
AIMS Mathematics | VOL. 9

A DCT based approach for detecting novelty and concept drift in data streams
Morteza Zi Hayat ... Mahmoud Reza Hashemi
-
Morteza Zi Hayat, et. al.Morteza Zi Hayat ... Mahmoud Reza Hashemi
01 Dec 2010
01 Dec 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Countering the Concept-Drift Problem in Big Data Using iOVFDT

Abstract

Talk to us

Similar Papers