Data Warehousing Process Modeling from Classical Approaches to New Trends: Main Features and Comparisons

Asma Dhaouadi,Khadija Bousselmi,Sébastien Monnet,Mohamed Mohsen Gammoudi,Slimane Hammoudi

doi:10.3390/data7080113

Abstract

The extract, transform, and load (ETL) process is at the core of data warehousing architectures. As such, the success of data warehouse (DW) projects is essentially based on the proper modeling of the ETL process. As there is no standard model for the representation and design of this process, several researchers have made efforts to propose modeling methods based on different formalisms, such as unified modeling language (UML), ontology, model-driven architecture (MDA), model-driven development (MDD), and graphical flow, which includes business process model notation (BPMN), colored Petri nets (CPN), Yet Another Workflow Language (YAWL), CommonCube, entity modeling diagram (EMD), and so on. With the emergence of Big Data, despite the multitude of relevant approaches proposed for modeling the ETL process in classical environments, part of the community has been motivated to provide new data warehousing methods that support Big Data specifications. In this paper, we present a summary of relevant works related to the modeling of data warehousing approaches, from classical ETL processes to ELT design approaches. A systematic literature review is conducted and a detailed set of comparison criteria are defined in order to allow the reader to better understand the evolution of these processes. Our study paints a complete picture of ETL modeling approaches, from their advent to the era of Big Data, while comparing their main characteristics. This study allows for the identification of the main challenges and issues related to the design of Big Data warehousing systems, mainly involving the lack of a generic design model for data collection, storage, processing, querying, and analysis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Data	Publication Date: Aug 12, 2022
Citations: 13	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Data Warehousing Process Modeling from Classical Approaches to New Trends: Main Features and Comparisons

Abstract

Talk to us

Similar Papers

More From: Data

Lead the way for us

Similar Papers

Transformation of BPMN Diagrams to YAWL Nets
Jianhong Ye ... Wen Song
Journal of Software | VOL. 5
Jianhong Ye, et. al.Jianhong Ye ... Wen Song
04 Jan 2010
Journal of Software | VOL. 5

Extraction, Transformation, and Loading
Alejandro Vaisman ... Esteban Zimányi
-
Alejandro Vaisman, et. al.Alejandro Vaisman ... Esteban Zimányi
01 Jan 2014
01 Jan 2014

Applying MDA to the development of data warehouses
Jose-Norberto Mazon ... Juan Trujillo
-
Jose-Norberto Mazon, et. al.Jose-Norberto Mazon ... Juan Trujillo
04 Nov 2005
04 Nov 2005

A model-driven framework for ETL process development
Zineb El Akkaoui ... Esteban Zimànyi
-
Zineb El Akkaoui, et. al.Zineb El Akkaoui ... Esteban Zimànyi
28 Oct 2011
28 Oct 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data Warehousing Process Modeling from Classical Approaches to New Trends: Main Features and Comparisons

Abstract

Talk to us

Similar Papers

More From: Data