Abstract

A data warehouse (DW) is a vast repository of data that facilitates decision-making for businesses and companies. This concept dates back to the 1980s and it has been widely accepted. One of the key points for the success of the process of data warehousing lies in the definition of the warehouse model depending on data sources and analysis needs. Once the data warehouse is designed, the content and structure of the data sources, as well as the requirements analysis are required to evolve, therefore, an evolution of the model must take place (diagram and data). In this context, several approaches have been developed to design and implement data warehouses. Nevertheless, there is no standard process that deals with designing all of the data warehouse layers, also, there is no software that encompasses this type of problem. In general, the majority of these approaches focus on a particular aspect of data warehouse such as data storage, ETL process, OLAP, reporting, etc, and does not cover its entire lifecycle. A Model-Driven Architecture (MDA) is a standard approach, its aims to support all phases of software manufacturing by promoting the use of models and the transformations between them. Moreover, this approach aims to automate the process of software engineering, thereby decreasing the cost of software development and enhancing its productivity. In this study, we present a systematic review of various works on the data warehouse design methods. We compare and discuss these works according to the criteria that seem relevant for this issue. We present a new design approach for multidimensional schemas construction from relational models using MDA techniques, we also develop the resulting research perspectives.

Highlights

  • Data warehouses (DW) are characterized by a complicated architecture, they are built from transactional sources via ETL (Extract - Transform - Load) processes

  • It is model-based, it allows a higher level of abstraction during development, and allows the separation between platform independent models (PIM) and platform specific models (PSM)

  • ─ PIM (Platform Independent Model): it is a design model which describes the system independently of any technical platform and any technology used to deploy the application, this model represents the business logic specific to the system and it is sustainable over time, it consists of UML class diagrams

Read more

Summary

Introduction

Data warehouses (DW) are characterized by a complicated architecture, they are built from transactional sources via ETL (Extract - Transform - Load) processes. In practice, DWs are characterized by a dynamic that affects the stored data and their structures. It is iJOE ‒ Vol 17, No 10, 2021. We will review the existing literature dealing with the problem of data warehousing, so we can respond to the various limits that block the standardization of the MDA approach. The structure of this present study is as follows. The last section presents the conclusions along with future work

Decision support system
Data Warehouse
MDA: Model Driven Architecture
Systematic Review
Reviews Results
Findings and Analysis
Proposal of an MDA approach for the development of data warehouses
Conclusion and future work
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.