Graph-based data clustering via multiscale community detection

Zijing Liu,Mauricio Barahona

doi:10.1007/s41109-019-0248-7

Abstract

We present a graph-theoretical approach to data clustering, which combines the creation of a graph from the data with Markov Stability, a multiscale community detection framework. We show how the multiscale capabilities of the method allow the estimation of the number of clusters, as well as alleviating the sensitivity to the parameters in graph construction. We use both synthetic and benchmark real datasets to compare and evaluate several graph construction methods and clustering algorithms, and show that multiscale graph-based clustering achieves improved performance compared to popular clustering methods without the need to set externally the number of clusters.

Highlights

Clustering is a classic task in data mining, whereby input data are organised into groups such that data points within a group are more similar to each other than to those outside the group (Xu and Wunsch 2005)
We evaluate several geometric graph constructions, from methods that use only local distances to others that balance local and global measures, and find that the recently proposed Continuous k-nearest neighbours (CkNN) graph (Berry and Sauer 2019) performs well for graph-based data clustering via community detection
We have investigated the use of multiscale community detection for graph-based data clustering

Summary

Introduction

Clustering is a classic task in data mining, whereby input data are organised into groups (or clusters) such that data points within a group are more similar to each other than to those outside the group (Xu and Wunsch 2005). We evaluate several geometric graph constructions, from methods that use only local distances to others that balance local and global measures, and find that the recently proposed Continuous k-nearest neighbours (CkNN) graph (Berry and Sauer 2019) performs well for graph-based data clustering via community detection.

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Network Science	Publication Date: Jan 8, 2020
Citations: 40	License type: open-access

R Discovery Prime

R Discovery Prime

Graph-based data clustering via multiscale community detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Network Science

Lead the way for us

Similar Papers

Geometric multiscale community detection: Markov stability and vector partitioning
Zijing Liu ... Mauricio Barahona
Journal of Complex Networks | VOL. 6
Zijing Liu, et. al.Zijing Liu ... Mauricio Barahona
26 Jul 2017
Journal of Complex Networks | VOL. 6

Improved Self-Adaptive ACS Algorithm to Determine the Optimal Number of Clusters
Ayad Mohammed Jabbar ... Ku Ruhana Ku-Mahamud
International Journal on Advanced Science, Engineering and Information Technology | VOL. 11
Ayad Mohammed Jabbar, et. al.Ayad Mohammed Jabbar ... Ku Ruhana Ku-Mahamud
21 Jun 2021
International Journal on Advanced Science, Engineering and Information Technology | VOL. 11

A More Relaxed Model for Graph-Based Data Clustering: s-Plex Cluster Editing
Jiong Guo ... Johannes Uhlmann
SIAM Journal on Discrete Mathematics | VOL. 24
Jiong Guo, et. al.Jiong Guo ... Johannes Uhlmann
01 Jan 2009
SIAM Journal on Discrete Mathematics | VOL. 24

Advanced Cost based Graph Clustering Algorithm for Random Geometric Graphs
K K Shukla ... Mousumi Dhara
International Journal of Computer Applications | VOL. 60
K K Shukla, et. al.K K Shukla ... Mousumi Dhara
18 Dec 2012
International Journal of Computer Applications | VOL. 60

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Graph-based data clustering via multiscale community detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Network Science