Developing an efficient spectral clustering algorithm on large scale graphs in spark

Ahmed I Taloba,Taysir Hassan A Soliman,Marwan R Riad

doi:10.1109/intelcis.2017.8260077

Abstract

Recently, most of the data can be represented by graph structures, such as social media, Protein-Protein Interaction, transportation system, systems biology,…, etc. Many researches have been achieved to cluster very large graphs but more efficient algorithms are required since such a process takes a long time and requires more memory. In this paper, we propose an Efficient Spectral Clustering Algorithm on Large Scale Graphs in Spark (ESCALG), using map reduce function and shuffling phases in Dijkstra's algorithm. In addition, ESCALG depends mainly on a sparse matrix as a data structure, which less time in execution. Then, GraphX is applied to deal with graph data processing and in GraphX used Pregel in computing shortest path. To test the performance of ESCALG, it is compared with Large-Scale Spectral Clustering on Graphs and Standard Spectral Clustering Algorithms using seven datasets, where ESCALG proved high efciency in terms of memory and time performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Developing an efficient spectral clustering algorithm on large scale graphs in spark

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Regularized spectral clustering under the mixed membership stochastic block model
Huan Qing ... Jingli Wang
Neurocomputing | VOL. 550
Huan Qing, et. al.Huan Qing ... Jingli Wang
26 Jun 2023
Neurocomputing | VOL. 550

Efficient implementation of scatter-gather operations for large scale graph analytics
Manoj Kumar ... Pratap Pattnaik
-
Manoj Kumar, et. al.Manoj Kumar ... Pratap Pattnaik
01 Sep 2016
01 Sep 2016

Designing an efficient parallel spectral clustering algorithm on multi-core processors in Julia
Zenan Huo ... Fabio Giampaolo
Journal of Parallel and Distributed Computing | VOL. 138
Zenan Huo, et. al.Zenan Huo ... Fabio Giampaolo
20 Jan 2020
Journal of Parallel and Distributed Computing | VOL. 138

Large scale graph processing systems: survey and an experimental evaluation
Omar Batarfi ... Sherif Sakr
Cluster Computing | VOL. 18
Omar Batarfi, et. al.Omar Batarfi ... Sherif Sakr
24 Jul 2015
Cluster Computing | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Developing an efficient spectral clustering algorithm on large scale graphs in spark

Abstract

Talk to us

Similar Papers