Scalable teacher forcing network for semi-supervised large scale data streams

Mahardhika Pratama,Choiru Za’In,Edwin Lughofer,Eric Pardede,Dwi A.P Rahayu

doi:10.1016/j.ins.2021.06.075

Abstract

The large-scale data stream problem refers to high-speed information flow which cannot be processed in scalable manner under a traditional computing platform. This problem also imposes expensive labelling cost making the deployment of fully supervised algorithms unfeasible. On the other hand, the problem of semi-supervised large-scale data streams is little explored in the literature because most works are designed in the traditional single-node computing environments while also being fully supervised approaches. This paper offers Weakly Supervised Scalable Teacher Forcing Network (WeScatterNet) to cope with the scarcity of labelled samples and the large-scale data streams simultaneously. WeScatterNet is crafted under distributed computing platform of Apache Spark with a data-free model fusion strategy for model compression after parallel computing stage. It features an open network structure to address the global and local drift problems while integrating a data augmentation, annotation and auto-correction (DA3) method for handling partially labelled data streams. The performance of WeScatterNet is numerically evaluated in the six large-scale data stream problems with only 25% label proportions. It shows highly competitive performance even if compared with fully supervised learners with 100% label proportions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Scalable teacher forcing network for semi-supervised large scale data streams

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Journal: Information Sciences	Publication Date: Jun 26, 2021
Citations: 13

Similar Papers

Balanced Parallel Frequent Pattern Mining over Massive Data Stream
Xi Fu ... Lei Shi
-
Xi Fu, et. al.Xi Fu ... Lei Shi
01 Apr 2017
01 Apr 2017

Dynamic pattern matching with multiple queries on large scale data streams
S Sukhanov ... A.M Zoubir
Signal Processing | VOL. 171
S Sukhanov, et. al.S Sukhanov ... A.M Zoubir
26 Nov 2019
Signal Processing | VOL. 171

Large-Scale Data Stream Processing Systems
Paris Carbone ... Juan Soto
-
Paris Carbone, et. al.Paris Carbone ... Juan Soto
01 Jan 2017
01 Jan 2017

Correlated Anomaly Detection from Large Streaming Data
Zheng Chen ... Bo Song
-
Zheng Chen, et. al.Zheng Chen ... Bo Song
01 Dec 2018
01 Dec 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scalable teacher forcing network for semi-supervised large scale data streams

Abstract

Talk to us

Similar Papers

More From: Information Sciences