TAPER: query-aware, partition-enhancement for large, heterogenous graphs

Hugo Firth,Paolo Missier

doi:10.1007/s10619-017-7196-y

Abstract

Graph partitioning has long been seen as a viable approach to addressing Graph DBMS scalability. A partitioning, however, may introduce extra query processing latency unless it is sensitive to a specific query workload, and optimised to minimise inter-partition traversals for that workload. Additionally, it should also be possible to incrementally adjust the partitioning in reaction to changes in the graph topology, the query workload, or both. Because of their complexity, current partitioning algorithms fall short of one or both of these requirements, as they are designed for offline use and as one-off operations. The TAPER system aims to address both requirements, whilst leveraging existing partitioning algorithms. TAPER takes any given initial partitioning as a starting point, and iteratively adjusts it by swapping chosen vertices across partitions, heuristically reducing the probability of inter-partition traversals for a given path queries workload. Iterations are inexpensive thanks to time and space optimisations in the underlying support data structures. We evaluate TAPER on two different large test graphs and over realistic query workloads. Our results indicate that, given a hash-based partitioning, TAPER reduces the number of inter-partition traversals by sim 80%; given an unweighted Metis partitioning, by sim 30%. These reductions are achieved within eight iterations and with the additional advantage of being workload-aware and usable online.

Highlights

Path queries over labelled graphs are increasingly common in many applications
In this paper we present TAPER, a graph re-partitioning system that is sensitive to evolving query workloads
We have presented TAPER: a practical system for improving path query processing performance in partitioned graph data

Summary

Introduction

Path queries over labelled graphs are increasingly common in many applications. These include fraud detection [27], recommender systems [9] and social analysis [2] amongst others. Such a labelled graph has the form G = (V, E, L V , l), where each vertex v is annotated with a label l(v) ∈ L V from a predefined set L V of labels (e.g. Purchase, Person, etc...). In this work we address the problem of efficiently and incrementally improving path query performance over k−way partitionings of large, heterogeneous, labelled graphs.

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Distributed and Parallel Databases	Publication Date: May 2, 2017
Citations: 9	License type: open-access

R Discovery Prime

R Discovery Prime

TAPER: query-aware, partition-enhancement for large, heterogenous graphs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Distributed and Parallel Databases

Lead the way for us

Similar Papers

Query-Sensitive Graph Partitioner for Pattern Matching Applications
Li Lu ... Bei Hua
IEEE Access | VOL. 7
Li Lu, et. al.Li Lu ... Bei Hua
01 Jan 2019
IEEE Access | VOL. 7

Parallel and External High Quality Graph Partitioning

-

01 Jan 2019
01 Jan 2019

Enhanced Multilevel Hybrid Algorithm for Graph Partitioning
Annu Arora ... Karanvir Kaur
International Journal of Computer Applications | VOL. 120
Annu Arora, et. al.Annu Arora ... Karanvir Kaur
18 Jun 2015
International Journal of Computer Applications | VOL. 120

Experimental Analysis of Streaming Algorithms for Graph Partitioning
Anil Pacaci ... M Tamer Özsu
-
Anil Pacaci, et. al.Anil Pacaci ... M Tamer Özsu
25 Jun 2019
25 Jun 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TAPER: query-aware, partition-enhancement for large, heterogenous graphs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Distributed and Parallel Databases