An Efficient Method for Speeding up Large-Scale Data Transfer Process to Database: A Case Study

Ginanjar Wiro Sasmito,M Nishom

doi:10.14569/ijacsa.2019.0101255

Abstract

Among the of characteristics of Large Data complexity comprising of volume, velocity, variety, and veracity (4Vs), this paper focuses on the volume to ensure a better performance of data extract, transform, and load processes in the context of data migration from one server to the other due to the necessity of update to the population data of Tegal City. An approach often used by most programmers in the Department of Population and Civil Registration of Tegal City is conducting the transfer process by transferring all available data (in specific file format) to the database server regardless of the file size. It is prone to errors that may disrupt the data transfer process like timeout, oversized data package, or even lengthy execution time due to large data size. The research compares several approaches to extract, transform, and load/transfer large data to a new server database using a command line and native-PHP programming language (object-oriented and procedural style) with different file format targets, namely SQL, XML, and CSV. The performance analysis that we conducted showed that the big scale data transfer method using LOAD DATA INFILE statement with comma-separated value (CSV) data source extension is the fastest and effective, therefore recommendable.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Efficient Method for Speeding up Large-Scale Data Transfer Process to Database: A Case Study

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2019
License type: cc-by

Similar Papers

Identifying semantic characteristics of user interaction datasets through application of a data analysis
Fernando De Assis Rodrigues ... Ricardo César Gonçalves Sant’Ana
-
Fernando De Assis Rodrigues, et. al.Fernando De Assis Rodrigues ... Ricardo César Gonçalves Sant’Ana
01 Jan 2018
01 Jan 2018

A Mass Spectrometry Proteomics Data Management Platform
Vagisha Sharma ... Michael Riffle
Molecular & Cellular Proteomics | VOL. 11
Vagisha Sharma, et. al.Vagisha Sharma ... Michael Riffle
01 Sep 2012
Molecular & Cellular Proteomics | VOL. 11

ICEFormat—the image cytometry experiment format
Josef Spidlen ... David Novo
Cytometry Part A | VOL. 81A
Josef Spidlen, et. al.Josef Spidlen ... David Novo
08 Oct 2012
Cytometry Part A | VOL. 81A

Chemalot and chemalot_knime: Command line programs as workflow tools for drug discovery
Man-Ling Lee ... Jianwen A Feng
Journal of Cheminformatics | VOL. 9
Man-Ling Lee, et. al.Man-Ling Lee ... Jianwen A Feng
12 Jun 2017
Journal of Cheminformatics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Efficient Method for Speeding up Large-Scale Data Transfer Process to Database: A Case Study

Abstract

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications