Fast Distance-based Outlier Detection in Data Streams based on Micro-clusters

Luan Tran,Liyue Fan,Cyrus Shahabi

doi:10.1145/3368926.3369667

Abstract

Continuous outlier detection in data streams is one important topic in data mining. It has many applications in public health, network intrusion detection, and fraud detection. Over the last two decades of research, many studies have been conducted on distance-based outlier detection algorithms which are viable, scalable, and parameter-free approaches. Because streaming data points arrive and expire over time, the challenge is to monitor the outlier status of data points with time and space efficiency. In this study, we propose three algorithms: O-MCOD, U-MCOD, and M-MCOD. These algorithms improve upon the state-of-the-art algorithm in distance-based outlier detection in data streams, i.e., MCOD, by relaxing the constraints of micro-clusters and using the minimal probing principal. With extensive experiments on synthetic and real-world datasets, we show that the proposed algorithms are superior in time and space efficiency. Specially, our proposed algorithms are 1.5 to 95 times faster than MCOD, require as low as 25% peak memory compared to MCOD, and outperform the most recent algorithm NETS.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast Distance-based Outlier Detection in Data Streams based on Micro-clusters

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Outlier Detection in Non-stationary Data Streams
Luan Tran ... Cyrus Shahabi
-
Luan Tran, et. al.Luan Tran ... Cyrus Shahabi
23 Jul 2019
23 Jul 2019

Distance-based outlier detection in data streams
Luan Tran ... Liyue Fan
Proceedings of the VLDB Endowment | VOL. 9
Luan Tran, et. al.Luan Tran ... Liyue Fan
01 Aug 2016
Proceedings of the VLDB Endowment | VOL. 9

An Effective Minimal Probing Approach With Micro-Cluster for Distance-Based Outlier Detection in Data Streams
Mohamed Jaward Bah ... Hanan Aljuaid
IEEE Access | VOL. 7
Mohamed Jaward Bah, et. al.Mohamed Jaward Bah ... Hanan Aljuaid
01 Jan 2019
IEEE Access | VOL. 7

A Fast and Efficient Local Outlier Detection in Data Streams
Xing Yang ... Wenli Zhou
-
Xing Yang, et. al.Xing Yang ... Wenli Zhou
25 Feb 2019
25 Feb 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast Distance-based Outlier Detection in Data Streams based on Micro-clusters

Abstract

Talk to us

Similar Papers