Presto's History-Based Query Optimizer

Pranjal Shankhdhar,Feilong Liu,Jay Narale,James Sun,Rebecca Schlussel,Lyublena Antova

doi:10.14778/3685800.3685828

Abstract

An important feature of modern query optimizers is the ability to produce a query plan that is optimal for the underlying data set. This requires the ability to estimate cardinalities and computational costs of intermediate query plan nodes, which is highly dependent on both the query shape and the underlying data distribution. Traditional methods include collecting statistics on base tables and implementing cardinality and computational cost derivation inside the optimizer, which is error-prone for complex query shapes. This paper presents Presto's novel history-based optimization framework (HBO), which collects execution histories and uses them to optimize similar queries in the future. The framework produces accurate estimates for complex query shapes in a lightweight, automated manner, and adapts automatically to changes in underlying data distributions. We present the design and implementation of the HBO framework and provide details on its use in various optimization rules, as well as details on implementing the statistics store on top of a Redis key-value store. We also present the results of running HBO in production in two large data infrastructure organizations (Meta and Uber).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Presto's History-Based Query Optimizer

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment

Lead the way for us

Similar Papers

Cost-Based Query Optimization via AI Planning
Nathan Robinson ... David Toman
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 28
Nathan Robinson, et. al.Nathan Robinson ... David Toman
21 Jun 2014
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 28

From Massive Parallelization to Quantum Computing: Seven Novel Approaches to Query Optimization

-

01 Jan 2015
01 Jan 2015

Wavelet-based histograms for selectivity estimation
Yossi Matias ... Jeffrey Scott Vitter
ACM SIGMOD Record | VOL. 27
Yossi Matias, et. al.Yossi Matias ... Jeffrey Scott Vitter
01 Jun 1998
ACM SIGMOD Record | VOL. 27

Wavelet-based histograms for selectivity estimation
Yossi Matias ... Jeffrey Scott Vitter
-
Yossi Matias, et. al.Yossi Matias ... Jeffrey Scott Vitter
01 Jun 1998
01 Jun 1998

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Presto's History-Based Query Optimizer

Abstract

Talk to us

Similar Papers

More From: Proceedings of the VLDB Endowment