Abstract
Abstract The grouping of nodes into subsets that are relatively densely interconnected and separable from the rest of the network is a property often displayed in many complex real-world networks; this feature is known as a community structure. There is a growing demand for algorithms that can find partitions that resemble the community structure of a given network as closely as possible. However, most popular algorithms for community detection in graphs have one serious drawback, namely, they are heuristic-based and in many cases are unable to find a near-optimal solution. Moreover, their results are volatile, impacting the replicability of their results. In this paper, we investigate if the performance of greedy algorithms might be improved by initialising such algorithms with some carefully chosen partition of nodes, namely a partition obtained by embedding the nodes into real numbers space and then running a clustering algorithm on this latent representation. We believe that embedding will filter unwanted noise while retaining the proximity of nodes belonging to the same community or will learn more complex and elusive relations between nodes. Then, clustering algorithms run on this embedding will create a stable partitioning that will reduce the uncertainty in the initial phases of the community detection algorithms. The experiments show that the proposed procedure significantly improves the results over baseline community detection algorithms, namely Louvain and Leiden. It also reduces the inherent volatility of such algorithms. The impact depends on the given graph’s properties, especially the strength of the community structure and degree distribution. The largest boost in performance is given in the cases when networks are ‘noisier’, that is, when the community structure is less pronounced and there are many connections between communities. Furthermore, the design and parametrization of the procedure depend on the network’s topology, not on the size of the network itself.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.