Abstract

Community detection involves grouping the nodes of a network such that nodes in the same community are more densely connected to each other than to the rest of the network. Previous studies have focused mainly on identifying communities in networks using node connectivity. However, each node in a network may be associated with many attributes. Identifying communities in networks combining node attributes has become increasingly popular in recent years. Most existing methods operate on networks with attributes of binary, categorical, or numerical type only. In this study, we introduce kNN-enhance, a simple and flexible community detection approach that uses node attribute enhancement. This approach adds the k Nearest Neighbor (kNN) graph of node attributes to alleviate the sparsity and the noise effect of an original network, thereby strengthening the community structure in the network. We use two testing algorithms, kNN-nearest and kNN-Kmeans, to partition the newly generated, attribute-enhanced graph. Our analyses of synthetic and real world networks have shown that the proposed algorithms achieve better performance compared to existing state-of-the-art algorithms. Further, the algorithms are able to deal with networks containing different combinations of binary, categorical, or numerical attributes and could be easily extended to the analysis of massive networks.

Highlights

  • Existing methods can be classified roughly into two categories

  • We can obtain a k Nearest Neighbor (kNN) graph by using a set of node attributes

  • The kNN-graph is combined with the original network to compensate for sparsity, thereby strengthening the community structure of the network

Read more

Summary

Introduction

The first category is composed of probabilistic generative models that formulate joint models of link connections and node attributes, and that use the models to infer the posterior community memberships of nodes in a network[8,9,10,11,12,13,14,15,16,17]. Most of the methods mentioned above follow the assumption that cluster memberships related to node attributes must be consistent with community memberships determined by link structure for a network. It is not always true in real world networks. The approach can handle large-scale attributed networks by combining fast approximate kNN-graph algorithms[35,36,37] with fast community detection algorithms such as BGLL38 and Informap[39]

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.