Concept Drifting Data Streams Research Articles

A supervised learning algorithm aims to build a prediction model using training examples. This paradigm typically has the assumptions that the underl ying distribution and the true input–output dependency do not change. However, these assumptions often fail to hold, especially in data streams. This phenomenon is known as concept drift.We propose a new model combining algorithm for tracking concept drift in data streams. The final predictive ensemble model has a form of a weighted average and ridge regression combiner. The coefficients of the combiner are determined by ridge regression with the constraints such that the coefficients are nonnegative and sum to 1. The proposed algorithm is devised via a new measure of concept drift, the angle between the estimated weights from data and the optimal weight vector obtained under no concept drift. It is shown that the ridge tuning parameter plays a crucial role of forcing the proposed algorithm to adapt to concept drift. Our main findings include (i) the proposed algorithm can achieve the optimal weights in the case of no concept drift if the tuning parameter is sufficiently large, and (ii) the angle is monotonically increasing as the tuning parameter decreases. These imply that if the tuning parameter is well-controlled, the algorithm can produce weights which reflect the degree of concept drift measured by the angle. Using various numerical examples, it is shown that the proposed algorithm can track concept drift better than other existing ensemble methods. Supplemental materials, computer code and R-package, are available online.

Read full abstract

AbstractAn established method to detect concept drift in data streams is to perform statistical hypothesis testing on the multivariate data in the stream. The statistical theory offers rank‐based statistics for this task. However, these statistics depend on a fixed set of characteristics of the underlying distribution. Thus, they work well whenever the change in the underlying distribution affects the properties measured by the statistic, but they perform not very well, if the drift influences the characteristics caught by the test statistic only to a small degree. To address this problem, we show how uniform convergence bounds in learning theory can be adjusted for adaptive concept drift detection. In particular, we present three novel drift detection tests, whose test statistics are dynamically adapted to match the actual data at hand. The first one is based on a rank statistic on density estimates for a binary representation of the data, the second compares average margins of a linear classifier induced by the 1‐norm support vector machine (SVM), and the last one is based on the average zero‐one, sigmoid or stepwise linear error rate of an SVM classifier. We compare these new approaches with the maximum mean discrepancy method, the StreamKrimp system, and the multivariate Wald–Wolfowitz test. The results indicate that the new methods are able to detect concept drift reliably and that they perform favorably in a precision‐recall analysis. Copyright © 2009 Wiley Periodicals, Inc. Statistical Analysis and Data Mining 2: 311‐327, 2009

Read full abstract

Concept Drifting Data Streams Research Articles

Related Topics

Articles published on Concept Drifting Data Streams

Model Averaging via Penalized Regression for Tracking Concept Drift

Adaptive concept drift detection

Adaptive concept drift detection

Ambiguous decision trees for mining concept-drifting data streams

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Concept Drifting Data Streams Research Articles

Related Topics

Articles published on Concept Drifting Data Streams

Model Averaging via Penalized Regression for Tracking Concept Drift

Adaptive concept drift detection

Adaptive concept drift detection

Ambiguous decision trees for mining concept-drifting data streams