Abstract

For a given pair of pattern and data graphs, the subgraph isomorphism finding problem locates all instances of the pattern graph into the data graph. For a given subgraph isomorphic image of the pattern graph in a data graph, the set of all ordered pairs of the pattern graph’s vertices and their respective images data graph is called an embedding. Many solvers, such as $\mathrm {Turbo_{ISO}}$ , Glasgow , and VF3 exist in the literature for subgraph isomorphism finding problem. Though each solver aims to minimize computing costs in its own way, computational efficiency is still a central issue for the subgraph isomorphism finding problem. In this paper, we present the development of an efficient solver, SubGlw , for subgraph isomorphism finding which first decomposes data graph into small-size candidate subgraphs using a ranking function and then searches the embeddings of the pattern graph in each of them separately. The ranking function is designed in such a way that it minimizes both number and size of the candidate subgraphs. The performance of SubGlw is empirically evaluated and compared with two state-of-the-art subgraph isomorphism solvers – SubISO and Glasgow over three benchmark datasets – Yeast , Human , and Hprd . The experimental findings reveal that SubGlw performs significantly better in terms of both embedding count and execution time . We have also presented an analysis for identifying saddle point , which is a timeout at which our solver achieves maximum embeddings in least execution time. This analysis provides a better understanding for parameter settings. The source codes of SubGlw can be downloaded from https://github.com/ZubairAliIgraph/SubGlw-master .

Highlights

  • A graph is a well-known non-linear data structure which is used in many application areas, such as chemistry, biology, network science, and pattern recognition

  • We have presented an analysis to determine saddle point for the proposed subgraph isomorphism solver which helps in parameter settings

  • WORK In this paper, we have proposed the development of an efficient solver for subgraph isomorphism finding in large graphs

Read more

Summary

INTRODUCTION

A graph is a well-known non-linear data structure which is used in many application areas, such as chemistry, biology, network science, and pattern recognition. Subgraph isomorphism finding problem is to identify all instances of a pattern graph in a given data graph, wherein generally the size. The subgraph isomorphism solver identifies all embeddings of the pattern graph into the candidate subgraphs and combines them together to form the solution of the subgraph isomorphism finding problem. A. OUR CONTRIBUTIONS In this paper, we have proposed a subgraph isomorphism finding solver SubGlw. In line with [7], the proposed solver decomposes a data graph into several candidate subgraphs using a ranking function, which optimizes the size and count of the candidate subgraphs. We have presented an analysis to determine saddle point for the proposed subgraph isomorphism solver which helps in parameter settings.

RELATED WORKS
PROPOSED METHOD
PIVOT VERTEX SELECTION
EXPERIMENTAL SETUP AND RESULTS
Findings
CONCLUSION AND FUTURE WORK
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call