Ap-FSM: A parallel algorithm for approximate frequent subgraph mining using Pregel

Vandana Bhatia,Rinkle Rani

doi:10.1016/j.eswa.2018.04.010

Abstract

Large graphs are scale-free, ubiquitous having irregular relationships and non-trivial topology. Frequent subgraph mining is a popular method for knowledge extraction from graphs. Most of the existing frequent subgraph mining algorithms are centralized algorithms that cannot handle a single large graph efficiently and incur high communication cost. However, to make the task of subgraph mining less expensive computationally, approximate subgraph mining can be applied which will capture similar structure subgraphs as of exact subgraph mining. In this paper, we propose an approximate subgraph mining algorithm named Ap-FSM implemented on distributed graph environment Pregel. The working of Ap-FSM is divided into three phases. The first phase selects the representative graph from the original graph while preserving the original graph properties. The second phase efficiently performs subgraph extension. Phase 3 introduces a novel two-step optimization for performing subgraph pruning. Analyzing such large graph data will be beneficial from the perspective of expert and intelligent systems, as discovered patterns can be used for knowledge discovery and decision making. To evaluate the performance of Ap-FSM, experiments are performed over three real life datasets having up to billion edges. The results show that the proposed Ap-FSM significantly outperforms the state-of-art frequent subgraph mining algorithms and overcome the challenges of performing frequent subgraph mining on a massive large graph. It is also shown that Ap-FSM achieves high scalability and speedup in distributed graph environment and is highly accurate in finding frequent subgraphs from a single large graph.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Ap-FSM: A parallel algorithm for approximate frequent subgraph mining using Pregel

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Journal: Expert Systems With Applications	Publication Date: Apr 9, 2018
Citations: 21

Similar Papers

FSM-BC-BSP: Frequent Subgraph Mining Algorithm Based on BC-BSP
Fangling Leng ... Fan Li
Applied Sciences | VOL. 14
Fangling Leng, et. al.Fangling Leng ... Fan Li
09 Apr 2024
Applied Sciences | VOL. 14

Dynamic frequent subgraph mining algorithms over evolving graphs: a survey
Belgin Ergenç Bostanoğlu ... Nourhan Abuzayed
PeerJ Computer Science | VOL. 10
Belgin Ergenç Bostanoğlu, et. al.Belgin Ergenç Bostanoğlu ... Nourhan Abuzayed
08 Oct 2024
PeerJ Computer Science | VOL. 10

Subgraph mining in a large graph: A review
Lam B Q Nguyen ... Vaclav Snasel
WIREs Data Mining and Knowledge Discovery | VOL. 12
Lam B Q Nguyen, et. al.Lam B Q Nguyen ... Vaclav Snasel
08 Mar 2022
WIREs Data Mining and Knowledge Discovery | VOL. 12

HOPS: Probabilistic Subtree Mining for Small and Large Graphs
Pascal Welke ... Michael Kamp
-
Pascal Welke, et. al.Pascal Welke ... Michael Kamp
20 Aug 2020
20 Aug 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ap-FSM: A parallel algorithm for approximate frequent subgraph mining using Pregel

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications