Abstract

It is possible to extract valuable insights about the functional properties of a system by identifying and inspecting the community structure in the network that models the system. Community detection aims to extract these community structures from networks. Many community detection methods have been proposed that consider the problem from different perspectives. However, with the emergence of very large and complex networks from variety of domains, there has been a growing need for community detection methods that can operate at scale effectively and efficiently. Considering this, we propose a novel algorithm for large-scale community detection, based on two novel similarity indices we propose as well. In the first stage of our proposed algorithm, we generate candidate communities using a mechanism similar to information propagation very rapidly. Then, we merge small candidates that have fewer nodes than a calculated threshold with the larger ones using similarity between nodes and communities. Next, we engage a refinement operation on the candidates by moving all nodes to the candidates to which they are most similar using the same similarity index again. After that, we merge small communities with larger ones by using the similarity between communities until no gain in the modularity is obtained. Finally, in the last stage, we employ the same refinement operation as in the third stage. With an extensive experimentation on real-world and artificially-generated benchmark networks, we demonstrate and verify the performance and effectiveness of the proposed algorithm comparing it with the state-of-the-art methods. Experimental results indicate that our algorithm scales very well with growing size and complexity of networks. Besides, our algorithm outperforms most state-of-the-art community detection methods both in detection performance and computation time.

Highlights

  • Any complex system that is made of a number of interacting components can be modeled with a network structure, where nodes represent the components and edges represent the interactions among these components [1]

  • 1) REAL-WORLD NETWORKS very small in general, we considered the use of five real-world networks with ground-truth communities, which are commonly used as benchmarks in network community detection research

  • Since they require the number of communities to detect in advance, KM and Bisecting K-Means (BKM) are not commonly used for network community detection even though there are a large number of studies that utilize them in network analysis domain

Read more

Summary

INTRODUCTION

Any complex system that is made of a number of interacting components can be modeled with a network structure, where nodes represent the components and edges represent the interactions among these components [1] This makes network models a highly useful tool to study many phenomena in social sciences, economics, biology, ecology, logistics, communication, and so on along with the advances in computing technology and the techniques developed to analyze especially large and complex networks under the interdisciplinary research field of network science [2]. We observe a natural tendency to form dense groups of nodes that are typically denser than the whole network These dense structures are called network communities, and.

Tunali
RELATED WORK
PROPOSED ALGORITHM
SIMILARITY INDICES
EXPERIMENTAL SETTINGS
COMMUNITY DETECTION VALIDITY METRICS
EXPERIMENTAL PROCEDURE
EXPERIMENTAL RESULTS AND DISCUSSION
CONCLUSION AND FUTURE WORK
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call