A local algorithm to approximate the global clustering of streams generated in ubiquitous sensor networks

Pedro Pereira Rodrigues,Luís Lopes,João Gama,João Araújo

doi:10.1177/1550147718808239

Pedro Pereira Rodrigues, Luís Lopes + Show 2 more

Open Access

PDF Available

https://doi.org/10.1177/1550147718808239

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

In ubiquitous streaming data sources, such as sensor networks, clustering nodes by the data they produce gives insights on the phenomenon being monitored. However, centralized algorithms force communication and storage requirements to grow unbounded. This article presents L2GClust, an algorithm to compute local clusterings at each node as an approximation of the global clustering. L2GClust performs local clustering of the sources based on the moving average of each node’s data over time: the moving average is approximated using memory-less statistics; clustering is based on the furthest-point algorithm applied to the centroids computed by the node’s direct neighbors. Evaluation is performed both on synthetic and real sensor data, using a state-of-the-art sensor network simulator and measuring sensitivity to network size, number of clusters, cluster overlapping, and communication incompleteness. A high level of agreement was found between local and global clusterings, with special emphasis on separability agreement, while an overall robustness to incomplete communications emerged. Communication reduction was also theoretically shown, with communication ratios empirically evaluated for large networks. L2GClust is able to keep a good approximation of the global clustering, using less communication than a centralized alternative, supporting the recommendation to use local algorithms for distributed clustering of streaming data sources.

Highlights

Nowadays, information is generated and gathered from distributed data sources, at a very high rate, stressing communications and computing infrastructure
Clustering streaming data sources is the task of clustering different sources of data streams, based on the data series similarity.[1]
Algorithms aim to find groups of data sources that behave through time, which is usually measured in terms of the distance between the data series or the data distribution

Summary

Introduction

Information is generated and gathered from distributed data sources, at a very high rate, stressing communications and computing infrastructure. The moving average of each node is approximated using memoryless fading average, while clustering is based on the furthest-point algorithm applied to the centroids computed by the node’s direct neighbors This way, each sensor acts as data stream source and as a processing node, keeping a sketch of its own data and a definition of the clustering structure of the entire network of data sources. The idea behind this step is to aggregate all the locally defined centers and apply a clustering procedure on these centers, considering them as points for the clustering This way, if time this sensor uses or transmits its estimate Cx(i) of the global clustering structure, it is already updated with its most recent sketch and neighbors’ information.

Evaluation methodology

Evaluation results

Limitations and future work

Findings and recommendations

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Distributed Sensor Networks	Publication Date: Oct 1, 2018
Citations: 3	License type: cc-by

R Discovery Prime

A local algorithm to approximate the global clustering of streams generated in ubiquitous sensor networks

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: International Journal of Distributed Sensor Networks

Lead the way for us

Similar Papers

L2GClust
Pedro Pereira Rodrigues ... Luís Lopes
-
Pedro Pereira Rodrigues, et. al.Pedro Pereira Rodrigues ... Luís Lopes
21 Mar 2011
21 Mar 2011

Expressive and modular rule-basedclassifier for data streams

-

31 Jul 2019
31 Jul 2019

P14.53 Deconstructing pathologically increased MEG network clustering in glioma patients
S D Kulik ... M Klein
Neuro-Oncology | VOL. 21
S D Kulik, et. al.S D Kulik ... M Klein
06 Sep 2019
P14.53 Deconstructing pathologically increased MEG network clustering in glioma patients
S D Kulik ... M Klein

Distributed collaborative Web document clustering using cluster keyphrase summaries
Khaled Hammouda ... Mohamed Kamel
Information Fusion | VOL. 9
Khaled Hammouda, et. al.Khaled Hammouda ... Mohamed Kamel
23 Jan 2007
Information Fusion | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

A local algorithm to approximate the global clustering of streams generated in ubiquitous sensor networks

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: International Journal of Distributed Sensor Networks