Optimization of ETL Process in Data Warehouse Through a Combination of Parallelization and Shared Cache Memory

M Faridi Masouleh,A Toloie Eshlaghy,M Alborzi,M A Afshar Kazemi

doi:10.48084/etasr.849

Abstract

Extraction, Transformation and Loading (ETL) is introduced as one of the notable subjects in optimization, management, improvement and acceleration of processes and operations in data bases and data warehouses. The creation of ETL processes is potentially one of the greatest tasks of data warehouses and so its production is a time-consuming and complicated procedure. Without optimization of these processes, the implementation of projects in data warehouses area is costly, complicated and time-consuming. The present paper used the combination of parallelization methods and shared cache memory in systems distributed on the basis of data warehouse. According to the conducted assessment, the proposed method exhibited 7.1% speed improvement to kattle optimization instrument and 7.9% to talend instrument in terms of implementation time of the ETL process. Therefore, parallelization could notably improve the ETL process. It eventually caused the management and integration processes of big data to be implemented in a simple way and with acceptable speed.

Highlights

Toloie EshlaghyAbstract—Extraction, Transformation and Loading (ETL) is introduced as one of the notable subjects in optimization, management, improvement and acceleration of processes and operations in data bases and data warehouses
Data warehouse applications have utilized Extraction, Transformation and Loading (ETL) processes through tools that extract data from data resources, transform them to an acceptable format and load them in a data provider [1]
With regard to the examination of weak and strong points of former researches, the present paper has presented a new combined method by usage of parallelization techniques and simultaneous use of multiple cores to process and manage different databases in scattered locations as well as the application of cache memory shared between cores which conduct the operations of implementation, transformation and loading of data from distributed data bases in different locations and main data warehouse located in a definite place

Summary

Toloie Eshlaghy

Abstract—Extraction, Transformation and Loading (ETL) is introduced as one of the notable subjects in optimization, management, improvement and acceleration of processes and operations in data bases and data warehouses. The creation of ETL processes is potentially one of the greatest tasks of data warehouses and so its production is a time-consuming and complicated procedure. Without optimization of these processes, the implementation of projects in data warehouses area is costly, complicated and time-consuming. Parallelization could notably improve the ETL process It eventually caused the management and integration processes of big data to be implemented in a simple way and with acceptable speed

INTRODUCTION

CONCEPTS OF ETL

Extraction phase

Transformation phase

Loading Phase

Meta Data

ARCHITECTURE AND ANALYSIS OF THE RECOMMENDED

Shared Cache Memory

EVALUATION

CONCLUSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Engineering, Technology & Applied Science Research	Publication Date: Dec 18, 2016
Citations: 8	License type: cc-by

R Discovery Prime

R Discovery Prime

Optimization of ETL Process in Data Warehouse Through a Combination of Parallelization and Shared Cache Memory

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Engineering, Technology & Applied Science Research

Lead the way for us

Similar Papers

DESAIN ETL DENGAN CONTOH KASUS PERGURUAN TINGGI
Spits Warnars
Jurnal Informatika | VOL. 10
Spits WarnarsSpits Warnars
01 Oct 2011
Jurnal Informatika | VOL. 10

ETL Process in a Federal Educational Institution : Obtaining Functional Information and Geolocation of Retired Servers
Edivaldo Da Silva Souza ... Luiz Antonio Abrantes
-
Edivaldo Da Silva Souza, et. al.Edivaldo Da Silva Souza ... Luiz Antonio Abrantes
23 Jun 2021
23 Jun 2021

A shared context approach for supporting experts in data ETL (Extraction, Transformation and Loading) processes
Hassane Tahir ... Patrick Brezillon
-
Hassane Tahir, et. al.Hassane Tahir ... Patrick Brezillon
01 Nov 2011
01 Nov 2011

CAWE DW Documenter: A Model-Driven Tool for Customizable ETL Documentation Generation
Robert Krawatzeck ... Marcus Hofmann
-
Robert Krawatzeck, et. al.Robert Krawatzeck ... Marcus Hofmann
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimization of ETL Process in Data Warehouse Through a Combination of Parallelization and Shared Cache Memory

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Engineering, Technology &amp; Applied Science Research

More From: Engineering, Technology & Applied Science Research