Abstract

Discovery of communities in complex networks is a fundamental data analysis problem with applications in various domains. While most of the existing approaches have focused on discovering communities of nodes, recent studies have shown the advantages and uses of link community discovery in networks. Generative models provide a promising class of techniques for the identification of modular structures in networks, but most generative models mainly focus on the detection of node communities rather than link communities. In this work, we propose a generative model, which is based on the importance of each node when forming links in each community, to describe the structure of link communities. We proceed to fit the model parameters by taking it as an optimization problem, and solve it using nonnegative matrix factorization. Thereafter, in order to automatically determine the number of communities, we extend the above method by introducing a strategy of iterative bipartition. This extended method not only finds the number of communities all by itself, but also obtains high efficiency, and thus it is more suitable to deal with large and unexplored real networks. We test this approach on both synthetic benchmarks and real-world networks including an application on a large biological network, and compare it with two highly related methods. Results demonstrate the superior performance of our approach over competing methods for the detection of link communities.

Highlights

  • Many complex systems in the real world exist in the form of networks, such as social networks, biological networks, Web networks, etc., which are collectively referred to as complex networks

  • One of the main problems in the study of complex networks is the detection of community structure [1], a subject that keeps attracting a great deal of interest

  • For instance, most individuals belong to multiple communities such as families, friends, and coworkers, while the link between a pair of individuals often exists for a dominant reason which may represent family ties, friendship, or professional relationships, et al the links connected to a single node may belong to several different link communities, the node can be assigned to multiple communities of links

Read more

Summary

Introduction

Many complex systems in the real world exist in the form of networks, such as social networks, biological networks, Web networks, etc., which are collectively referred to as complex networks. The motivation is that link communities are more intuitive than node communities in many real-world networks. This is due to the link usually having a unique identity, while the node tends to have multiple roles. For instance, most individuals belong to multiple communities such as families, friends, and coworkers, while the link between a pair of individuals often exists for a dominant reason which may represent family ties, friendship, or professional relationships, et al the links connected to a single node may belong to several different link communities, the node can be assigned to multiple communities of links. Overlapping communities of nodes, which is another attractive topic in community detection [11], could be detected as a natural byproduct of link communities

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call