Which Category Is Better: Benchmarking Relational and Graph Database Management Systems

Yijian Cheng,Wei Lu,Pengjie Ding,Xiaoyong Du,Tongtong Wang

doi:10.1007/s41019-019-00110-3

Yijian Cheng, Wei Lu + Show 3 more

Open Access

https://doi.org/10.1007/s41019-019-00110-3

Copy DOI

Journal: Data Science and Engineering	Publication Date: Nov 11, 2019
Citations: 30	License type: open-access

Affiliation: Renmin University of China

Abstract

Over decades, relational database management systems (RDBMSs) have been the first choice to manage data. Recently, due to the variety properties of big data, graph database management systems (GDBMSs) have emerged as an important complement to RDBMSs. As pointed out in the existing literature, both RDBMSs and GDBMSs are capable of managing graph data and relational data; however, the boundaries of them still remain unclear. For this reason, in this paper, we first extend a unified benchmark for RDBMSs and GDBMSs over the same datasets using the same query workload under the same metrics. We then conduct extensive experiments to evaluate them and make the following findings: (1) RDBMSs outperform GDMBSs by a substantial margin under the workloads which mainly consist of group by, sort, and aggregation operations, and their combinations; (2) GDMBSs show their superiority under the workloads that mainly consist of multi-table join, pattern match, path identification, and their combinations.

Highlights

From the perspective of relational database management systems (RDBMSs), RDBMSs have been proved to be capable of dealing with graph data management and analysis, and a few recent works have shown that by extending SQL languages in RDBMSs to support graph operations, the performance is comparable between RDBMSs and graph database management systems (GDBMSs) [10,11,12,13,14], verifying this capacity of RDBMSs
– We extend a unified benchmark for both RDBMSs and GDBMSs to evaluate them under the same datasets as well as the same metrics
To gain a better understanding, we further provide two examples of the transformations from primitive operations in RDBMSs to their counterpart in GDBMSs which are shown in Algorithm 1 and Algorithm 2, respectively

Summary

Introduction

From the perspective of GDBMSs, their advantage lies in schema-less property They are able to manage structured, unstructured, or semi-structured data and are more flexible than RDBMSs. Given that GDBMSs are able to manage relational data, it is argued that RDBMSs may be replaced by GDBMSs. On the other hand, from the perspective of RDBMSs, RDBMSs have been proved to be capable of dealing with graph data management and analysis, and a few recent works have shown that by extending SQL languages in RDBMSs to support graph operations, the performance is comparable between RDBMSs and GDBMSs [10,11,12,13,14], verifying this capacity of RDBMSs. due to the lack of a unified graph model and query language across GDBMSs, which often incurs an extra programming and maintenance overhead for users, the necessity of GDBMSs is disputed.

Related Work

A Unified Benchmark for RDBMSs and GDBMSs

Data Generation Schemes

Datasets to be Used

50 MB 100 MB 500 MB 1 GB

Query Workload

Atomic Relational Queries

TPC‐H Query Workloads

11: RETURN

Graph Query workload

13: TRUNCATE Rcount

Representatives of RDBMSs and GDBMSs to be Compared

Metrics

Experimental Setup

Back‐end Storage Engines Evaluation

Relational Operation Evaluation

Graph Algorithm Evaluation

Conclusion

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Which Category Is Better: Benchmarking Relational and Graph Database Management Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Science and Engineering

Lead the way for us

Similar Papers

Which Category Is Better: Benchmarking the RDBMSs and GDBMSs
Pengjie Ding ... Yijian Cheng
-
Pengjie Ding, et. al.Pengjie Ding ... Yijian Cheng
01 Jan 2019
01 Jan 2019

(Graph Database: A Survey)
Amitabha Bhattacharyya ... Durgapada Chakravarty
-
Amitabha Bhattacharyya, et. al.Amitabha Bhattacharyya ... Durgapada Chakravarty
01 Jan 2020
01 Jan 2020

RDF Query Processing: Relational Vs. Graph Approach
Ami Pandat ... Minal Bhise
-
Ami Pandat, et. al.Ami Pandat ... Minal Bhise
01 Jan 2021
01 Jan 2021

A comparative evaluation of RDBMS and GDBMS for shortest path operations on pedestrian navigation data
Vaios Patras ... Evaggelos Karvounis
-
Vaios Patras, et. al.Vaios Patras ... Evaggelos Karvounis
24 Sep 2021
24 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Which Category Is Better: Benchmarking Relational and Graph Database Management Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Science and Engineering