Abstract

AbstractStructural graph clustering is an important problem in the domain of graph data management. Given a large graph G, structural graph clustering is to assign vertices to clusters where vertices in the same cluster are densely connected to each other and vertices in different clusters are loosely connected to each other. Due to its importance, many algorithms have been proposed to study this problem. However, no effort focuses on the distributed graph environment. In this paper, we propose a parallel computing framework named SGP (short for Statistics-based Graph Partition) to support large graph clustering under distributed environment. We first use historical clustering information to partition graph into a group of clusters. Based on the partition result, we can properly assign vertexes to different nodes based on connection relationship among vertex. When a clustering request is submitted, we can use properties leading by the partition for efficiently clustering. Finally, we conduct extensive performance studies on large real and synthetic graphs, which demonstrate that our new approach could efficiently support large graph clustering under distributed environment.KeywordsStructural graphClusterDistributedCore-vertex

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.