Abstract

Given a network, local community detection aims at finding the community that contains a set of query nodes (seed nodes). Random walk (RW) based algorithms have shown great success in various local community detection scenarios. Starting from the seed nodes, RW based algorithms continuously sample random walk paths to get the clustering result. However, current RW based algorithms for local clustering are faced with the following two problems. The random walker is insensitive to the community boundary and might have an unbalanced walk. These problems would have a negative effect on the clustering result of RW based algorithms. In this paper, we propose a density sensitive random walk algorithms (DSRW) for local community detection. By integrating the graph density information into the random walk process, the problems are resolved and the clustering result is improved. We provide the convergence proof of DSRW algorithm and perform extensive experiments on both real and synthetic datasets. Results show that our DSRW algorithm has achieved the state-of-the-art result in most scenarios.

Highlights

  • Network is a common representation structure for many real-world relations

  • For the existing random walk based algorithms for local community detection, i.e., random walk with restart (RWR) [9], vertex reinforcement random walk (VRW) [7], Color random walk algorithm (CRW) [20], etc., none of these algorithms can determine whether the random walker agent is at the community boundary position

  • Results show that density sensitive random walk (DSRW) algorithm achieves higher performance compared to the state-of-the-art methods

Read more

Summary

INTRODUCTION

Network is a common representation structure for many real-world relations (e.g. the social network). Random walk (RW) [7], [9], [10], [17], [20], [22] based algorithms are the most prevalent methods used in local community detection In these methods, random walk paths are repeatedly sampled around the seed nodes until a converged random walker visiting probability distribution for nodes is obtained. For the existing random walk based algorithms for local community detection, i.e., RWR [9], VRW [7], CRW [20], etc., none of these algorithms can determine whether the random walker agent is at the community boundary position. The network is colored by the converged visiting probability of the random walker in each algorithm It shows that DSRW algorithm has a more balanced walk inside the community and fewer visits for the nodes outside the community. All the datasets and codes used in this paper will be open source later

RELATED WORK
PRELIMINARIES
EXPERIMENT
CASE STUDY
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.