Sub-optimal Join Order Identification with L1-error

Yesdaulet Izenov,Brian Tsan,Asoke Datta,Florin Rusu

doi:10.1145/3639272

Abstract

Q-error -- the standard metric for quantifying the error of individual cardinality estimates -- has been widely adopted as a surrogate for query plan optimality in recent work on learning-based cardinality estimation. However, the only result connecting Q-error with plan optimality is an upper-bound on the cost of the worst possible query plan computed from a set of cardinality estimates---there is no connection between Q-error and the real plans generated by standard query optimizers. Therefore, in order to identify sub-optimal query plans, we propose a learning-based method having as its main feature a novel measure called L1-error. Similar to Q-error, L1-error requires complete knowledge of true cardinalities and estimates for all the sub-plans of a query plan. Unlike Q-error, which considers the estimates independently, L1-error is defined as a permutation distance between true cardinalities and estimates for all the sub-plans having the same number of joins. Moreover, L1-error takes into account errors relative to the magnitude of their cardinalities and gives larger weight to small multi-way joins. Our experimental results confirm that, when L1-error is integrated into a standard decision tree classifier, it leads to the accurate identification of sub-optimal plans across four different benchmarks. This accuracy can be further improved by combining L1-error with Q-error into a composite feature that can be computed without overhead from the same data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sub-optimal Join Order Identification with L1-error

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Management of Data

Lead the way for us

Journal: Proceedings of the ACM on Management of Data	Publication Date: Mar 12, 2024
License type: cc-by

Similar Papers

Cost-Guided Cardinality Estimation: Focus Where it Matters
Parimarjan Negi ... Ryan Marcus
-
Parimarjan Negi, et. al.Parimarjan Negi ... Ryan Marcus
01 Apr 2020
01 Apr 2020

Flow-loss
Parimarjan Negi ... Hongzi Mao
Proceedings of the VLDB Endowment | VOL. 14
Parimarjan Negi, et. al.Parimarjan Negi ... Hongzi Mao
01 Jul 2021
Proceedings of the VLDB Endowment | VOL. 14

Smooth Scan: robust access path selection without cardinality estimation
Renata Borovica-Gajic ... Anastasia Ailamaki
The VLDB Journal | VOL. 27
Renata Borovica-Gajic, et. al.Renata Borovica-Gajic ... Anastasia Ailamaki
29 May 2018
The VLDB Journal | VOL. 27

ROSIE: Runtime Optimization of SPARQL Queries over RDF Using Incremental Evaluation
Lei Gai ... Tengjiao Wang
-
Lei Gai, et. al.Lei Gai ... Tengjiao Wang
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sub-optimal Join Order Identification with L1-error

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Management of Data