Abstract

Processing a reachability query in large-scale networks using existing methods remains one of the most challenging problems in graph mining. In this paper, we propose a novel incremental algorithmic framework for arbitrary-order reachability computation in massive graphs. The proposed method is intuitive and significantly outperforms the currently known methods in terms of computation time. We focus on the arbitrary-order reachability matrix framework called AORM, which can handle directed and disconnected networks such as citation networks. The AORM can handle diverse types of real-world datasets. We conduct extensive experimental studies with twenty synthetic networks generated from five random graph generation models and twenty massive real-world networks. The experimental results show the advantages of the method in terms of both computational efficiency and approximation controllability. In particular, the proposed method outperforms up to 10 times compared to NetworkX for incremental all-pairs shortest paths computation. Moreover, the computational results of the method rapidly converge to the ground truths. Thus, we can get the correct solution in the early stage of the incremental approximation. We can employ the method as a versatile feature extraction framework for network embedding. Overall, the experimental results present a remarkable improvement in speed-up for reachability computation.

Highlights

  • T HE World Health Organization (WHO) declared COVID-19 a pandemic caused by the new SARS-CoV2 virus in March 2020

  • THE algorithmic framework (AORM) ALGORITHM we introduce our proposed AORM-based three algorithms for computing reachability and all-pairs shortest paths (APSP) on directed, disconnected graphs

  • We present incremental AORM to utilize the summation of non-zero elements in the hierarchical reachability matrix based on neighborhood information

Read more

Summary

Introduction

T HE World Health Organization (WHO) declared COVID-19 a pandemic caused by the new SARS-CoV2 virus in March 2020. COVID-19 has formed a new scientific community to address the pandemic, unlike other recent crises. Many data analysts and researchers in modern graph analytics utilize new pandemic analysis methods and recent artificial intelligence approaches to tackle this problem. Most researchers often employ adjacency matrices as representations of these graphs to perform fundamental graph analytics operations, such as reachability queries, all-pairs shortest paths (APSP), and so on. In most practical applications, adjacency matrices of massive graphs are usually sparse [2]–[4]. These applications appropriate unweighted edges rather than weighted edges to describe relationships between vertices in a graph

Objectives
Methods
Findings
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call