Evaluation of Replication Mechanisms on Selected Database Systems

Tomáš Pohanka,Vilém Pechanec

doi:10.3390/ijgi9040249

Tomáš Pohanka, Vilém Pechanec

Open Access

https://doi.org/10.3390/ijgi9040249

Copy DOI

Abstract

This paper is focused on comparing database replication over spatial data in PostgreSQL and MySQL. Database replication means solving various problems with overloading a single database server with writing and reading queries. There are many replication mechanisms that are able to handle data differently. Criteria for objective comparisons were set for testing and determining the bottleneck of the replication process. The tests were done over the real national vector spatial datasets, namely, ArcCR500, Data200, Natural Earth and Estimated Pedologic-Ecological Unit. HWMonitor Pro was used to monitor the PostgreSQL database, network and system load. Monyog was used to monitor the MySQL activity (data and SQL queries) in real-time. Both database servers were run on computers with the Microsoft Windows operating system. The results from the provided tests of both replication mechanisms led to a better understanding of these mechanisms and allowed informed decisions for future deployment. Graphs and tables include the statistical data and describe the replication mechanisms in specific situations. PostgreSQL with the Slony extension with asynchronous replication synchronized a batch of changes with a high transfer speed and high server load. MySQL with synchronous replication synchronized every change record with low impact on server performance and network bandwidth.

Highlights

Replication is a process of copying and maintaining database objects [1]
The experiment was completed for different types of replication mechanisms in different SDBMS to provide a broader perspective on replication mechanisms
This study aimed to describe and test available database replication solutions that provide replication functionality over spatial data

Summary

Introduction

Replication is a process of copying and maintaining database objects [1]. The replicated databases are monitored for changes and are synchronized when a change is made [1]. Database replication makes a copy of data accessible from many various servers instead of accessing one central server, or enables many servers to behave like one (parallel query processing) [1]. There are typical scenarios where database replications are used, e.g., secure lower latency for long-distance communication, separate raw data storage and user data storage, secure high availability and performance. Replication can be improperly considered as a mechanism for creating data backups. This is not the primary function of database replications. The essence of database replication is to create a redundant, interconnected distributed environment

Objectives

Methods

Results

Conclusion