Abstract

Community search has been extensively studied in large networks, such as Protein-Protein Interaction (PPI) networks, citation graphs, and collaboration networks. However, in terms of widely existing multi-valued networks, where each node has d (d ≥ 1) numerical attributes, almost all existing algorithms either completely ignore the attributes of node at all or only consider one attribute. To solve this problem, the concept of skyline community was presented, based on the concepts of k-core and skyline recently. The skyline community is defined as a maximal k-core that satisfies some influence constraints, which is very useful in depicting the communities that are not dominated by other communities in multi-valued networks. However, the algorithms proposed on skyline community search can only work in the special case that the nodes have different values on each attribute, and the computation complexity degrades exponentially as the number of attributes increases. In this work, we turn our attention to the general scenario where multiple nodes may have the same attribute value. Specifically, we first present an algorithm, called MICS, which can find all skyline communities in a multi-valued network. To improve computation efficiency, we then propose a dimension reduction based algorithm, called P-MICS, using the maximum entropy method. Our algorithm can significantly reduce the skyline community searching time, while is still able to find almost all cohesive skyline communities. Extensive experiments on real-world datasets demonstrate the efficiency and effectiveness of our algorithms.

Highlights

  • Community search is a fundamental problem in network analysis, which has attracted much attention recently due to its wide applications, such as protein structure analysis[1], organization of activities[2], and advertising[3]

  • The network with d numerical attributes is modeled as a multi-valued graph G D .V; E; A/, where V, E, and A represent the sets of nodes, edges, and d -dimensional vectors associated to nodes, respectively

  • Each node is associated with d attribute value, and the size of these values indicates its importance

Read more

Summary

Introduction

Community search is a fundamental problem in network analysis, which has attracted much attention recently due to its wide applications, such as protein structure analysis[1], organization of activities[2], and advertising[3]. Due to the fact that in real-world applications, it usually only needs to search important communities, i.e., those with high cohesiveness, it admits to improve the computation efficiency by ignoring communities that are less important. With this intuition, we present an algorithm, called P-MISC, which can significantly reduce the computation time of the skyline community search. The algorithm adopts the maximum entropy method to discover a small number of crucial attributes, which can reserve the information of the network as much as possible This approach can help find almost all high cohesive skyline communities.

Related Work
Algorithm for dD2
Algorithm for d 3
Algorithm with dimension reduction
Efficiency evaluation
Findings
Quantity evaluation
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call