Smart Intra-query Fault Tolerance for Massive Parallel Processing Databases

Yunhong Ji,Yunpeng Chai,Lipeng Ren,Xuan Zhou,Yajie Qin

doi:10.1007/s41019-019-00114-z

Yunhong Ji, Yunpeng Chai + Show 3 more

Open Access

https://doi.org/10.1007/s41019-019-00114-z

Copy DOI

Abstract

Intra-query fault tolerance has increasingly been a concern for online analytical processing, as more and more enterprises migrate data analytical systems from mainframes to commodity computers. Most massive parallel processing (MPP) databases do not support intra-query fault tolerance. They may suffer from prolonged query latency when running on unreliable commodity clusters. While SQL-on-Hadoop systems can utilize the fault tolerance support of low-level frameworks, such as MapReduce and Spark, their cost-effectiveness is not always acceptable. In this paper, we propose a smart intra-query fault tolerance (SIFT) mechanism for MPP databases. SIFT achieves fault tolerance by performing checkpointing, i.e., materializing intermediate results of selected operators. Different from existing approaches, SIFT aims at promoting query success rate within a given time. To achieve its goal, it needs to: (1) minimize query rerunning time after encountering failures and (2) introduce as less checkpointing overhead as possible. To evaluate SIFT in real-world MPP database systems, we implemented it in Greenplum. The experimental results indicate that it can improve success rate of query processing effectively, especially when working with unreliable hardware.

Highlights

Massive parallel processing (MPP) databases are popular data platforms for enterprise and scientific data analysis
We propose a smart intra-query fault tolerance (SIFT) mechanism for MPP databases
We show how SIFT enables Greenplum to achieve a certain degree of intra-query fault tolerance while preserving its performance in query processing

Summary

Introduction

Massive parallel processing (MPP) databases are popular data platforms for enterprise and scientific data analysis. Fault tolerance of query processing has become increasingly important to MPP databases. Commodity clusters are much less reliable than mainframes, such that databases have to deal with system failures proactively. We provide an overview about the architecture of MPP database Typical MPP databases usually adopt a shared-nothing architecture [2], composed of one master node and n slave nodes. The master node is responsible for interacting with clients, managing the whole cluster and coordinating the query processing. Each of the n salve nodes is responsible for storing a partition of the data and performing query processing on its partition.

Objectives

Methods

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Data Science and Engineering	Publication Date: Dec 19, 2019
Citations: 12	License type: open-access

R Discovery Prime

R Discovery Prime

Smart Intra-query Fault Tolerance for Massive Parallel Processing Databases

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Science and Engineering

Lead the way for us

Similar Papers

An approach to architecture-based fault tolerance evaluation with fault propagation
Shaoguang Shu ... Yichen Wang
-
Shaoguang Shu, et. al.Shaoguang Shu ... Yichen Wang
01 Oct 2015
01 Oct 2015

FT-PBLAS: PBLAS-Based Fault-Tolerant Linear Algebra Computation on High-performance Computing Systems
Yanchao Zhu ... Guozhen Zhang
IEEE Access | VOL. 8
Yanchao Zhu, et. al.Yanchao Zhu ... Guozhen Zhang
01 Jan 2020
IEEE Access | VOL. 8

Fault detection and tolerance mechanisms for future 1000 core systems
Bernhard Fechner ... Theo Ungerer
-
Bernhard Fechner, et. al.Bernhard Fechner ... Theo Ungerer
01 Jul 2013
Fault detection and tolerance mechanisms for future 1000 core systems
Bernhard Fechner ... Theo Ungerer

The Evolution of the Massively Parallel Processing Database in Support of Visual Analytics
Ian A Willson
Information Resources Management Journal | VOL. 24
Ian A WillsonIan A Willson
01 Oct 2011
Information Resources Management Journal | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Smart Intra-query Fault Tolerance for Massive Parallel Processing Databases

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Science and Engineering