Efficient access methods for very large distributed graph databases

David Luaces,José R.R Viqueira,José M Cotos,Julián C Flores

doi:10.1016/j.ins.2021.05.047

David Luaces, José R.R Viqueira + Show 2 more

Open Access

https://doi.org/10.1016/j.ins.2021.05.047

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Subgraph searching is an essential problem in graph databases, but it is also challenging due to the involved subgraph isomorphism NP-Complete sub-problem. Filter-Then-Verify (FTV) methods mitigate performance overheads by using an index to prune out graphs that do not fit the query in a filtering stage, reducing the number of subgraph isomorphism evaluations in a subsequent verification stage. Subgraph searching has to be applied to very large databases (tens of millions of graphs) in real applications such as molecular substructure searching. Previous surveys have identified the FTV solutions GraphGrepSX (GGSX) and CT-Index as the best ones for large databases (thousands of graphs), however they cannot reach reasonable performance on very large ones (tens of millions graphs). This paper proposes a generic approach for the distributed implementation of FTV solutions. Besides, three previous methods that improve the performance of GGSX and CT-Index are adapted to be executed in clusters. The evaluation shows how the achieved solutions provide a great performance improvement (between 70% and 90% of filtering time reduction) in a centralized configuration and how they may be used to achieve efficient subgraph searching over very large databases in cluster configurations.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information Sciences	Publication Date: May 26, 2021
Citations: 6	License type: cc-by-nc-nd

R Discovery Prime

Efficient access methods for very large distributed graph databases

Abstract

Published Version

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Similar Papers

Out-of-core coherent closed quasi-clique mining from large dense graph databases
Zhiping Zeng ... George Karypis
ACM Transactions on Database Systems | VOL. 32
Zhiping Zeng, et. al.Zhiping Zeng ... George Karypis
01 Jun 2007
ACM Transactions on Database Systems | VOL. 32

G-Hash: Towards Fast Kernel-based Similarity Search in Large Graph Databases.
Xiaohong Wang ... Jun Huan
Advances in database technology : proceedings. International Conference on Extending Database Technology | VOL. 360
Xiaohong Wang, et. al.Xiaohong Wang ... Jun Huan
24 Mar 2009
Advances in database technology : proceedings. International Conference on Extending Database Technology | VOL. 360

Scalable mining of large disk-based graph databases
Chen Wang ... Wei Wang
-
Chen Wang, et. al.Chen Wang ... Wei Wang
22 Aug 2004
22 Aug 2004

DualIso: An Algorithm for Subgraph Pattern Matching on Very Large Labeled Graphs
Matthew Saltz ... John A Miller
-
Matthew Saltz, et. al.Matthew Saltz ... John A Miller
01 Jun 2014
01 Jun 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Efficient access methods for very large distributed graph databases

Abstract

Published Version

Talk to us

Similar Papers

More From: Information Sciences