Performance Prediction for Concurrent Workloads in Distributed Database Systems

Hui Li,Zhenyu Dai,Ming Zhu,Xiaohuan Hou,Menglin Huang,Mei Chen

doi:10.1007/978-3-319-27140-8_43

Abstract

In order to store and process data at large-scale, distributed databases are built to partition data and process it in parallel on distributed nodes in a cluster. When the database concurrently execute heterogeneous query workloads, performance prediction is needed. However, running queries in a distributed database heavily touches upon the network overhead as the data transmission between cluster nodes. Hence, in this work, we take network latency into account when predict concurrent query performance. We propose a linear regression model to estimate the interaction when execute concurrent query for analytical workloads in distributed database system. Since network latency and overheads of local processing are the two most significant factors for query execution, we analyze the query behavior with multivariate regression on both of them at different degree of concurrency. In addition, we use sampling techniques to obtain various query mixes as concurrency level increasing. The experiments for evaluation the performance of our prediction model are conducted over a PostgreSQL database cluster with a representative analytical workloads of TPC-H, the experimental results demonstrates that the query latency predictions of our model can minimize the relative error within 14 % on average.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance Prediction for Concurrent Workloads in Distributed Database Systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Distributed Database Using Sharding Database Architecture
Bahaa Mahmoud Abdelhafiz
-
Bahaa Mahmoud AbdelhafizBahaa Mahmoud Abdelhafiz
16 Dec 2020
16 Dec 2020

16 - Distributed database systems
David Howe
Data Analysis for Database Design | VOL. -
David HoweDavid Howe
01 Jan 2001
Data Analysis for Database Design | VOL. -

Review of dynamic query optimization strategies in distributed database
Pankti Doshi ... Vijay Raisinghani
-
Pankti Doshi, et. al.Pankti Doshi ... Vijay Raisinghani
01 Apr 2011
01 Apr 2011

A SURVIVABLE DISTRIBUTED DATABASE AGAINST BYZANTINE FAILURE
D O Aborisade ... A A Adedeji
Journal of Natural Sciences Engineering and Technology | VOL. 15
D O Aborisade, et. al.D O Aborisade ... A A Adedeji
22 Nov 2017
Journal of Natural Sciences Engineering and Technology | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Prediction for Concurrent Workloads in Distributed Database Systems

Abstract

Talk to us

Similar Papers