A new framework based on features modeling and ensemble learning to predict query performance.

Mohamed Zaghloul,Amr Ali-Eldin,Mofreh Salem,Usman Qamar

doi:10.1371/journal.pone.0258439

Abstract

A query optimizer attempts to predict a performance metric based on the amount of time elapsed. Theoretically, this would necessitate the creation of a significant overhead on the core engine to provide the necessary query optimizing statistics. Machine learning is increasingly being used to improve query performance by incorporating regression models. To predict the response time for a query, most query performance approaches rely on DBMS optimizing statistics and the cost estimation of each operator in the query execution plan, which also focuses on resource utilization (CPU, I/O). Modeling query features is thus a critical step in developing a robust query performance prediction model. In this paper, we propose a new framework based on query feature modeling and ensemble learning to predict query performance and use this framework as a query performance predictor simulator to optimize the query features that influence query performance. In query feature modeling, we propose five dimensions used to model query features. The query features dimensions are syntax, hardware, software, data architecture, and historical performance logs. These features will be based on developing training datasets for the performance prediction model that employs the ensemble learning model. As a result, ensemble learning leverages the query performance prediction problem to deal with missing values. Handling overfitting via regularization. The section on experimental work will go over how to use the proposed framework in experimental work. The training dataset in this paper is made up of performance data logs from various real-world environments. The outcomes were compared to show the difference between the actual and expected performance of the proposed prediction model. Empirical work shows the effectiveness of the proposed approach compared to related work.

Highlights

A query optimizer attempts to make a comparable performance estimate
Select the type of model to run at each iteration. It has two options: gbtree: tree-based models gblinear: linear models The learning rate used to weight each model The maximum depth of each tree Gamma is a pseudo-regularization parameter Represents the fraction of observations to be sampled for each tree Number of features used in each tree reg: squarederror: for linear regression rmse –root mean square error
This paper proposed a framework for predicting query performance

Summary

Objectives

This paper aims to propose a query performance prediction framework that can be used to estimate query performance. The objective of this study is to show how to extract features from hardware, software, SQL syntax, and data architecture

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PloS one	Publication Date: Oct 18, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A new framework based on features modeling and ensemble learning to predict query performance.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

A new framework based on features modeling and ensemble learning to predict query performance
Usman Qamar ... Amr Ali-Eldin
-
Usman Qamar, et. al.Usman Qamar ... Amr Ali-Eldin
18 Oct 2021
18 Oct 2021

Learning-based Query Performance Modeling and Prediction
Mert Akdere ... Eli Upfal
-
Mert Akdere, et. al.Mert Akdere ... Eli Upfal
01 Apr 2012
01 Apr 2012

Cloud Query Processing with Reinforcement Learning-Based Multi-objective Re-optimization
Chenxiao Wang ... Le Gruenwald
-
Chenxiao Wang, et. al.Chenxiao Wang ... Le Gruenwald
01 Jan 2020
01 Jan 2020

Learning-based SPARQL query performance modeling and prediction
Wei Emma Zhang ... Lina Yao
World Wide Web | VOL. 21
Wei Emma Zhang, et. al.Wei Emma Zhang ... Lina Yao
24 Oct 2017
World Wide Web | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A new framework based on features modeling and ensemble learning to predict query performance.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one