Density-based clustering of big probabilistic graphs

Zahid Halim,Jamal Hussain Khattak

doi:10.1007/s12530-018-9223-2

Abstract

Clustering is a machine learning task to group similar objects in coherent sets. These groups exhibit similar behavior with-in their cluster. With the exponential increase in the data volume, robust approaches are required to process and extract clusters. In addition to large volumes, datasets may have uncertainties due to the heterogeneity of the data sources, resulting in the Big Data. Modern approaches and algorithms in machine learning widely use probability-theory in order to determine the data uncertainty. Such huge uncertain data can be transformed to a probabilistic graph-based representation. This work presents an approach for density-based clustering of big probabilistic graphs. The proposed approach deals with clustering of large probabilistic graphs using the graph’s density, where the clustering process is guided by the nodes’ degree and the neighborhood information. The proposed approach is evaluated using seven real-world benchmark datasets, namely protein-to-protein interaction, yahoo, movie-lens, core, last.fm, delicious social bookmarking system, and epinions. These datasets are first transformed to a graph-based representation before applying the proposed clustering algorithm. The obtained results are evaluated using three cluster validation indices, namely Davies–Bouldin index, Dunn index, and Silhouette coefficient. This proposal is also compared with four state-of-the-art approaches for clustering large probabilistic graphs. The results obtained using seven datasets and three cluster validity indices suggest better performance of the proposed approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Density-based clustering of big probabilistic graphs

Abstract

Talk to us

Similar Papers

More From: Evolving Systems

Lead the way for us

Journal: Evolving Systems	Publication Date: Mar 22, 2018
Citations: 20

Similar Papers

Efficient clustering of large uncertain graphs using neighborhood information
Zahid Halim ... Ahmar Rashid
International Journal of Approximate Reasoning | VOL. 90
Zahid Halim, et. al.Zahid Halim ... Ahmar Rashid
04 Aug 2017
International Journal of Approximate Reasoning | VOL. 90

Ensemble-based clustering of large probabilistic graphs using neighborhood and distance metric learning
Malihe Danesh ... Farzin Yaghmaee
The Journal of Supercomputing | VOL. 77
Malihe Danesh, et. al.Malihe Danesh ... Farzin Yaghmaee
14 Sep 2020
The Journal of Supercomputing | VOL. 77

Clustering of graphs using pseudo-guided random walk
Zahid Halim ... Muhammad Waqas
Journal of Computational Science | VOL. 51
Zahid Halim, et. al.Zahid Halim ... Muhammad Waqas
23 Jan 2021
Journal of Computational Science | VOL. 51

Clustering large probabilistic graphs using multi-population evolutionary algorithm
Zahid Halim ... Syed Fawad Hussain
Information Sciences | VOL. 317
Zahid Halim, et. al.Zahid Halim ... Syed Fawad Hussain
24 Apr 2015
Information Sciences | VOL. 317

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Density-based clustering of big probabilistic graphs

Abstract

Talk to us

Similar Papers

More From: Evolving Systems