Hierarchical Reduced-space Drift Detection Framework for Multivariate Supervised Data Streams

Shuyi Zhang,Peter Tino,Xin Yao

doi:10.1109/tkde.2021.3111756

Abstract

In a streaming environment, the characteristics of the data themselves and their relationship with the labels are likely to experience changes as time goes on. Most drift detection methods for supervised data streams are performance-based, that is, they detect changes only after the classication accuracy deteriorates. This may not be sufcient in many application areas where the reason behind a drift is also important. Another category of drift detectors are data distribution-based detectors. Although they can detect some drifts within the input space, changes affecting only the labelling mechanism cannot be identied. Furthermore, little work is available on drift detection for high-dimensional supervised data streams. In this paper we propose an advanced Hierarchical Reduced-space Drift Detection Framework for Supervised Data Streams (HRDS) which captures drifts regardless of their effects on classication performance. This framework suggests monitoring both marginal and class-conditional distributions within a lower-dimensional space specically relevant to the assigned classication task. Experimental comparisons have demonstrated that the proposed HRDS not only achieves high-quality performance on high-dimensional data streams, but also outperforms its competitors in terms of detection recall, precision and F-measure across a wide range of different concept drift types including subtle drifts.

Highlights

I N real-world applications such as weather prediction, industrial quality control and fraud detection, data often arrives in the form of a stream
Hierarchical change detection test (HCDT) has been shown to achieve more advantageous false positive rate (FPR) versus detection delay (DD) trade-off than its single change detection tests (CDTs) counterpart, but it has only been tested on nonlabelled scalar data [21]
We provide one possible realization for a binary classification problem as an illustrative example in this paper, it is worth noting that the general framework of Hierarchical Reduced-space Drift Detection (HRDD) is suitable for multi-class data streams

Summary

INTRODUCTION

I N real-world applications such as weather prediction, industrial quality control and fraud detection, data often arrives in the form of a stream. HCDT has been shown to achieve more advantageous false positive rate (FPR) versus detection delay (DD) trade-off than its single CDT counterpart, but it has only been tested on nonlabelled scalar data [21] Direct application of this framework to multivariate supervised data streams still suffers from the aforementioned deficiencies of distribution-based detectors. The contributions of our work include: 1) A new hierarchical detection framework proposed for supervised data streams that detects both real and virtual drifts. We provide one possible realization for a binary classification problem as an illustrative example in this paper, it is worth noting that the general framework of HRDD is suitable for multi-class data streams

Learning of a lower-dimensional subspace

Class-based detection

Knowledge base reconfiguration

Performance metrics

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Jan 1, 2021
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Hierarchical Reduced-space Drift Detection Framework for Multivariate Supervised Data Streams

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Similar Papers

Concept drift detector based on centroid distance analysis
Jakub Klikowski
-
Jakub KlikowskiJakub Klikowski
18 Jul 2022
18 Jul 2022

Comparative Analysis of Drift Detection Based Adaptive Ensemble Model with Different Drift Detection Techniques
Sanjeev Kumar ... Ravendra Singh
Journal of University of Shanghai for Science and Technology | VOL. 23
Sanjeev Kumar, et. al.Sanjeev Kumar ... Ravendra Singh
29 Jun 2021
Journal of University of Shanghai for Science and Technology | VOL. 23

IPMOD: An efficient outlier detection model for high-dimensional medical data streams
Yun Yang ... Honglin Xiong
Expert Systems with Applications | VOL. 191
Yun Yang, et. al.Yun Yang ... Honglin Xiong
30 Nov 2021
Expert Systems with Applications | VOL. 191

Handling adversarial concept drift in streaming data
Tegjyot Singh Sethi ... Mehmed Kantardzic
Expert Systems with Applications | VOL. 97
Tegjyot Singh Sethi, et. al.Tegjyot Singh Sethi ... Mehmed Kantardzic
11 Dec 2017
Expert Systems with Applications | VOL. 97

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hierarchical Reduced-space Drift Detection Framework for Multivariate Supervised Data Streams

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering