Efficient structure similarity searches: a partition-based approach

Xiang Zhao,Yang Wang,Wenjie Zhang,Xuemin Lin,Chuan Xiao

doi:10.1007/s00778-017-0487-0

Abstract

Graphs are widely used to model complex data in many applications, such as bioinformatics, chemistry, social networks, pattern recognition. A fundamental and critical query primitive is to efficiently search similar structures in a large collection of graphs. This article mainly studies threshold-based graph similarity search with edit distance constraints. Existing solutions to the problem utilize fixed-size overlapping substructures to generate candidates, and thus become susceptible to large vertex degrees and distance thresholds. In this article, we present a partition-based approach to tackle the problem. By dividing data graphs into variable-size non-overlapping partitions, the edit distance constraint is converted to a graph containment constraint for candidate generation. We develop efficient query processing algorithms based on the novel paradigm. Moreover, candidate-pruning techniques and an improved graph edit distance verification algorithm are developed to boost the performance. In addition, a cost-aware graph partitioning method is devised to optimize the index. Extending the partition-based filtering paradigm, we present a solution to the top- $$k$$ graph similarity search problem, where tailored filtering, look-ahead and computation-sharing strategies are exploited. Using both public real-life and synthetic datasets, extensive experiments demonstrate that our approaches significantly outperform the baseline and its alternatives.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient structure similarity searches: a partition-based approach

Abstract

Talk to us

Similar Papers

More From: The VLDB Journal

Lead the way for us

Journal: The VLDB Journal	Publication Date: Oct 24, 2017
Citations: 39

Similar Papers

A partition-based approach to structure similarity search
Xiang Zhao ... Wenjie Zhang
Proceedings of the VLDB Endowment | VOL. 7
Xiang Zhao, et. al.Xiang Zhao ... Wenjie Zhang
01 Nov 2013
Proceedings of the VLDB Endowment | VOL. 7

An improved global lower bound for graph edit similarity search
Karam Gouda ... Mona Arafa
Pattern Recognition Letters | VOL. 58
Karam Gouda, et. al.Karam Gouda ... Mona Arafa
26 Feb 2015
Pattern Recognition Letters | VOL. 58

G-Hash: Towards Fast Kernel-based Similarity Search in Large Graph Databases.
Xiaohong Wang ... Aaron Smalter
Advances in database technology : proceedings. International Conference on Extending Database Technology | VOL. 360
Xiaohong Wang, et. al.Xiaohong Wang ... Aaron Smalter
24 Mar 2009
Advances in database technology : proceedings. International Conference on Extending Database Technology | VOL. 360

Graph similarity search with edit distance constraint in large graph databases
Weiguo Zheng ... Lei Zou
-
Weiguo Zheng, et. al.Weiguo Zheng ... Lei Zou
27 Oct 2013
27 Oct 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient structure similarity searches: a partition-based approach

Abstract

Talk to us

Similar Papers

More From: The VLDB Journal