Enhancing Robustness of On-line Learning Models on Highly Noisy Data

Zilong Zhao,Rui Han,Sonia Ben Mokhtar,Lydia Y Chen,Sara Bouchenak,Robert Birke,Bogdan Robu

doi:10.1109/tdsc.2021.3063947

Abstract

Classification algorithms have been widely adopted to detect anomalies for various systems, e.g., IoT, cloud and face recognition, under the common assumption that the data source is clean, i.e., features and labels are correctly set. However, data collected from the wild can be unreliable due to careless annotations or malicious data transformation for incorrect anomaly detection. In this paper, we extend a two-layer on-line data selection framework: Robust Anomaly Detector (RAD) with a newly designed ensemble prediction where both layers contribute to the final anomaly detection decision. To adapt to the on-line nature of anomaly detection, we consider additional features of conflicting opinions of classifiers, repetitive cleaning, and oracle knowledge. We on-line learn from incoming data streams and continuously cleanse the data, so as to adapt to the increasing learning capacity from the larger accumulated data set. Moreover, we explore the concept of oracle learning that provides additional information of true labels for difficult data points. We specifically focus on three use cases, (i) detecting 10 classes of IoT attacks, (ii) predicting 4 classes of task failures of big data jobs, and (iii) recognising 100 celebrities faces. Our evaluation results show that RAD can robustly improve the accuracy of anomaly detection, to reach up to 98.95% for IoT device attacks (i.e., +7%), up to 85.03% for cloud task failures (i.e., +14%) under 40% label noise, and for its extension, it can reach up to 77.51% for face recognition (i.e., +39%) under 30% label noise. The proposed RAD and its extensions are general and can be applied to different anomaly detection algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing Robustness of On-line Learning Models on Highly Noisy Data

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Dependable and Secure Computing

Lead the way for us

Journal: IEEE Transactions on Dependable and Secure Computing	Publication Date: Jan 1, 2021
Citations: 2

Similar Papers

Robust Anomaly Detection on Unreliable Data
Zilong Zhao ... Bogdan Robu
-
Zilong Zhao, et. al.Zilong Zhao ... Bogdan Robu
01 Jun 2019
01 Jun 2019

Combining Outlier Detection and Reconstruction Error Minimization for Label Noise Reduction
Weining Zhang ... Xiaoyang Tan
-
Weining Zhang, et. al.Weining Zhang ... Xiaoyang Tan
01 Feb 2019
01 Feb 2019

MILAD: Robust Anomaly Detection for Electric Vehicles with Label Noise
Yu Ye ... Bailin Feng
Journal of Physics: Conference Series | VOL. 2132
Yu Ye, et. al.Yu Ye ... Bailin Feng
01 Dec 2021
Journal of Physics: Conference Series | VOL. 2132

Self-paced Robust Deep Face Recognition with Label Noise
Pengfei Zhu ... Wenya Ma
-
Pengfei Zhu, et. al.Pengfei Zhu ... Wenya Ma
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing Robustness of On-line Learning Models on Highly Noisy Data

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Dependable and Secure Computing