Abstract
Community search is a query request-oriented community detection problem. Given a query node $v$ in network $G$ , the goal of community search is to discover a community in $G$ that contains node $v$ . Traditional algorithms rely on carefully engineered features to measure local neighborhood structures. Designing these features is a time-consuming process that limits their practical application. Motivated by node embedding using deep learning method to learn distributed representations for nodes in networks, we propose a two-stage community search algorithm based on node embedding. To address the drawbacks of existing node embedding methods, we propose a node embedding model with a $CN$ -based random walk (NECNW) based on a skip-gram model in the first stage. Via NECNW, we learn a low-dimensional representation of nodes in networks. In the second stage, we propose a community quality metric $closeness{-}isolation$ ( $CI$ ) based on the learned vectors. Then, we expand the target community by greedy addition of a shell node that has maximum similarity with the current community. We evaluate the proposed algorithm on both real-world and synthetic networks with related community search and node embedding algorithms. The experimental results show that the proposed algorithm is more effective and efficient for community search than other algorithms.
Highlights
Community structure is a common property of complex networks
We adopt the common neighbors (CN ) metric [24] to measure the similarity between nodes and develop a two-stage community search algorithm based on node embedding with a CN -based random walk approach
In the second stage, based on the vector representation of nodes produced by NECNW, we propose a new goodness metric closeness-isolation (CI ) and design a community search algorithm by maximizing this metric
Summary
Community structure is a common property of complex networks. Essentially, a community is a group of nodes that are densely connected internally [1]. We adopt the common neighbors (CN ) metric [24] to measure the similarity between nodes and develop a two-stage community search algorithm based on node embedding with a CN -based random walk approach. In this paper, motivated by the advance of deep learning on networks, we propose a new similarity-based community search algorithm by combining node embedding technique. In contrast to the aforementioned algorithms, we adopt CN [24] to measure the closeness of nodes and propose a node embedding model, NECNW, with a CN -based random walk. We first define the network and present the problem definition of community search and evaluation metrics for measuring the effectiveness of a community search algorithm.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.