Fast cluster-based computation of exact betweenness centrality in large graphs

Cecile Daniel,Angelo Furno,Eugenio Zimeo,Lorenzo Goglia

doi:10.1186/s40537-021-00483-1

Abstract

Nowadays a large amount of data is originated by complex systems, such as social networks, transportation systems, computer and service networks. These systems can be modeled by using graphs and studied by exploiting graph metrics, such as betweenness centrality (BC), a popular metric to analyze node centrality of graphs. In spite of its great potential, this metric requires long computation time, especially for large graphs. In this paper, we present a very fast algorithm to compute BC of undirected graphs by exploiting clustering. The algorithm leverages structural properties of graphs to find classes of equivalent nodes: by selecting one representative node for each class, we are able to compute BC by significantly reducing the number of single-source shortest path explorations adopted by Brandes’ algorithm. We formally prove the graph properties that we exploit to define the algorithm and present an implementation based on Scala for both sequential and parallel map-reduce executions. The experimental evaluation of both versions, conducted with synthetic and real graphs, reveals that our solution largely outperforms Brandes’ algorithm and significantly improves known heuristics.

Highlights

The massive amount of data available today in many domains is often originated by complex systems that can be modeled as graphs where network centrality is exploited for identifying important nodes of the modeled systems
We focus on unweighted graphs while its extension to weighted ones can be obtained by substituting the breadth-first search (BFS) with Dijkstra algorithm
Bridges and articulation vertices are edges and nodes, respectively, whose removal from a graph leads to a new graph with a greater number of connected components; degree-1 vertices are leaf nodes which, considered as source and targets, contribute to the computation of betweenness centrality (BC) of crossed nodes; identical vertices are the ones characterized by the same neighbors and, by the same BC values; side vertices are nodes such that the graphs induced by their neighbors are cliques and they are not crossed by shortest paths

Summary

Introduction

The massive amount of data available today in many domains is often originated by complex systems that can be modeled as graphs (e.g., social networks, transportation networks, computer networks, service networks, etc.) where network centrality is exploited for identifying important nodes (or edges) of the modeled systems. Bridges and articulation vertices are edges and nodes, respectively, whose removal from a graph leads to a new graph with a greater number of connected components; degree-1 vertices are leaf nodes which, considered as source and targets, contribute to the computation of BC of crossed nodes; identical vertices are the ones characterized by the same neighbors and, by the same BC values; side vertices are nodes such that the graphs induced by their neighbors are cliques and they are not crossed by shortest paths By using all these techniques, the authors achieve significant speedup with different kinds of graphs. Incremental and approximated computations are approaches for specific classes of applications that regard slowly changing graphs or rank-based exploitation of BC, respectively, which we consider out of the scope of this paper

Background

This is the part of contribution due to w as a destination

Findings

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Big Data	Publication Date: Jun 26, 2021
Citations: 6	License type: open-access

R Discovery Prime

R Discovery Prime

Fast cluster-based computation of exact betweenness centrality in large graphs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Big Data

Lead the way for us

Similar Papers

Cluster-based Computation of Exact Betweenness Centrality in Large Undirected Graphs
Cecile Daniel ... Angelo Furno
-
Cecile Daniel, et. al.Cecile Daniel ... Angelo Furno
01 Dec 2019
01 Dec 2019

I/O-efficient calculation of H-group closeness centrality over disk-resident graphs
Junzhou Zhao ... Xiaohong Guan
Information Sciences | VOL. 400-401
Junzhou Zhao, et. al.Junzhou Zhao ... Xiaohong Guan
12 Mar 2017
Information Sciences | VOL. 400-401

Improved Social Network Analysis Method in SNS
...
-
, et. al. ...
01 Jan 2012
01 Jan 2012

Disconnection of network hubs and cognitive impairment after traumatic brain injury.
Erik D Fagerholm ... Peter J Hellyer
Brain | VOL. 138
Erik D Fagerholm, et. al.Erik D Fagerholm ... Peter J Hellyer
25 Mar 2015
Brain | VOL. 138

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast cluster-based computation of exact betweenness centrality in large graphs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Big Data