From Louvain to Leiden: guaranteeing well-connected communities

V A Traag,N J Van Eck,L Waltman

doi:10.1038/s41598-019-41695-z

Abstract

Community detection is often used to understand the structure of large and complex networks. One of the most popular algorithms for uncovering community structure is the so-called Louvain algorithm. We show that this algorithm has a major defect that largely went unnoticed until now: the Louvain algorithm may yield arbitrarily badly connected communities. In the worst case, communities may even be disconnected, especially when running the algorithm iteratively. In our experimental analysis, we observe that up to 25% of the communities are badly connected and up to 16% are disconnected. To address this problem, we introduce the Leiden algorithm. We prove that the Leiden algorithm yields communities that are guaranteed to be connected. In addition, we prove that, when the Leiden algorithm is applied iteratively, it converges to a partition in which all subsets of all communities are locally optimally assigned. Furthermore, by relying on a fast local move approach, the Leiden algorithm runs faster than the Louvain algorithm. We demonstrate the performance of the Leiden algorithm for several benchmark and real-world networks. We find that the Leiden algorithm is faster than the Louvain algorithm and uncovers better partitions, in addition to providing explicit guarantees.

Highlights

In many complex networks, nodes cluster and form relatively dense groups—often called communities [1, 2].Such a modular structure is usually not known beforehand
We show that the Louvain algorithm has a major problem, for both modularity and Constant Potts Model (CPM)
We suggested that the Leiden algorithm is faster than the Louvain algorithm, because of the fast local move approach

Summary

INTRODUCTION

Nodes cluster and form relatively dense groups—often called communities [1, 2] Such a modular structure is usually not known beforehand. One of the best-known methods for community detection is called modularity [3] This method tries to maximise the difference between the actual number of edges in a community and the expected number of such edges. Kc is the sum of the degrees of the nodes in community c and m is the total number of edges in the network This way of defining the expected number of edges is based on the so-called configuration model. We show that the Louvain algorithm has a major problem, for both modularity and CPM. We name our algorithm the Leiden algorithm, after the location of its authors

LOUVAIN ALGORITHM

Badly connected communities

LEIDEN ALGORITHM

Guarantees

EXPERIMENTAL ANALYSIS

Benchmark networks

Empirical networks

DISCUSSION

Non-decreasing move sequences

Greedy move sequences

Guarantees in each iteration

Guarantees in stable iterations

Findings

Asymptotic guarantees

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Mar 26, 2019
Citations: 2969	License type: open-access

R Discovery Prime

R Discovery Prime

From Louvain to Leiden: guaranteeing well-connected communities

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Community detection in complex networks using extremal optimization
Jordi Duch ... Alex Arenas
Physical Review E | VOL. 72
Jordi Duch, et. al.Jordi Duch ... Alex Arenas
24 Aug 2005
Physical Review E | VOL. 72

Comparison between Louvain and Leiden Algorithm for Network Structure: A Review
Siti Haryanti Hairol Anuar ... Siti Azirah Asmai
Journal of Physics: Conference Series | VOL. 2129
Siti Haryanti Hairol Anuar, et. al.Siti Haryanti Hairol Anuar ... Siti Azirah Asmai
01 Dec 2021
Journal of Physics: Conference Series | VOL. 2129

Performance Evaluation of Python Libraries for Community Detection on Large Social Network Graphs
Alif Dio Af'Ally ... Fitriyani
Indonesian Journal of Computer Science | VOL. 13
Alif Dio Af'Ally, et. al.Alif Dio Af'Ally ... Fitriyani
15 Jun 2024
Indonesian Journal of Computer Science | VOL. 13

Genetic Algorithm with Ensemble Learning for Detecting Community Structure in Complex Networks
Dongxiao He ... Chunguang Zhou
-
Dongxiao He, et. al.Dongxiao He ... Chunguang Zhou
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

From Louvain to Leiden: guaranteeing well-connected communities

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Scientific Reports