A Comprehensive Spark-Based Layer for Converting Relational Databases to NoSQL

Manal A Abdel-Fattah,Wael Mohamed,Sayed Abdelgaber

doi:10.3390/bdcc6030071

Manal A Abdel-Fattah, Wael Mohamed + Show 1 more

Open Access

https://doi.org/10.3390/bdcc6030071

Copy DOI

Journal: Big Data and Cognitive Computing	Publication Date: Jun 27, 2022
Citations: 2	License type: CC BY 4.0

Affiliation: Helwan University

Abstract

Currently, the continuous massive growth in the size, variety, and velocity of data is defined as big data. Relational databases have a limited ability to work with big data. Consequently, not only structured query language (NoSQL) databases were utilized to handle big data because NoSQL represents data in diverse models and uses a variety of query languages, unlike traditional relational databases. Therefore, using NoSQL has become essential, and many studies have attempted to propose different layers to convert relational databases to NoSQL; however, most of them targeted only one or two models of NoSQL, and evaluated their layers on a single node, not in a distributed environment. This study proposes a Spark-based layer for mapping relational databases to NoSQL models, focusing on the document, column, and key–value databases of NoSQL models. The proposed Spark-based layer comprises of two parts. The first part is concerned with converting relational databases to document, column, and key–value databases, and encompasses two phases: a metadata analyzer of relational databases and Spark-based transformation and migration. The second part focuses on executing a structured query language (SQL) on the NoSQL. The suggested layer was applied and compared with Unity, as it has similar components and features and supports sub-queries and join operations in a single-node environment. The experimental results show that the proposed layer outperformed Unity in terms of the query execution time by a factor of three. In addition, the proposed layer was applied to multi-node clusters using different scenarios, and the results show that the integration between the Spark cluster and NoSQL databases on multi-node clusters provided better performance in reading and writing while increasing the dataset size than using a single node.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comprehensive Spark-Based Layer for Converting Relational Databases to NoSQL

Abstract

Talk to us

Similar Papers

More From: Big Data and Cognitive Computing

Lead the way for us

Similar Papers

NoSQL Database Technologies
Michael Madison ... Mark Barnhill
Journal of International Technology and Information Management | VOL. 24
Michael Madison, et. al.Michael Madison ... Mark Barnhill
01 Jan 2015
Journal of International Technology and Information Management | VOL. 24

Concurrency versus consistency in NoSQL databases
Sonal Kanungo ... Rustom D Morena
Journal of Autonomous Intelligence | VOL. 7
Sonal Kanungo, et. al.Sonal Kanungo ... Rustom D Morena
28 Dec 2023
Journal of Autonomous Intelligence | VOL. 7

A Comparative Study of NoSQL Databases
...
International Journal of Advanced Research in Computer Science | VOL. 5
, et. al. ...
01 Jan 2014
International Journal of Advanced Research in Computer Science | VOL. 5

An Empirical Study of NoSQL Databases for Big Data
Wen-Chen Hu ... Hung-Jen Yang
-
Wen-Chen Hu, et. al.Wen-Chen Hu ... Hung-Jen Yang
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comprehensive Spark-Based Layer for Converting Relational Databases to NoSQL

Abstract

Talk to us

Similar Papers

More From: Big Data and Cognitive Computing