LPLM: A Neural Language Model for Cardinality Estimation of LIKE-Queries

Mehmet Aytimur,Michael Grossniklaus,Theodoros Chondrogiannis,Silvan Reiner,Leonard Wörteler

doi:10.1145/3639309

Abstract

Cardinality estimation is an important step in cost-based database query optimization. The accuracy of the estimates directly affects the ability of an optimizer to identify the most efficient query execution plan correctly. In this paper, we study cardinality estimation of LIKE-queries, i.e., queries that use the LIKE-operator to match a pattern with wildcards against string-valued attributes. While both traditional and machine-learning-based approaches have been proposed to tackle this problem, we argue that they all suffer from drawbacks. Most importantly, many state-of-the-art approaches are not designed for patterns that contain wildcards in-between characters. Based on past research on neural language models, we introduce the LIKE-Pattern Language Model (LPLM) that uses a new language and a novel probability distribution function to capture the semantics of general LIKE-patterns. We also propose a method to generate training data for our model. We demonstrate that our method outperforms state-of-the-art approaches in terms of precision (Q-error), while offering comparable runtime performance and memory requirements.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LPLM: A Neural Language Model for Cardinality Estimation of LIKE-Queries

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Management of Data

Lead the way for us

Journal: Proceedings of the ACM on Management of Data	Publication Date: Mar 12, 2024
License type: cc-by

Similar Papers

Efficiently adapting graphical models for selectivity estimation
Kostas Tzoumas ... Christian S Jensen
The VLDB Journal | VOL. 22
Kostas Tzoumas, et. al.Kostas Tzoumas ... Christian S Jensen
07 Nov 2012
The VLDB Journal | VOL. 22

A learning optimizer for a federated database management system
Stephan Ewen ... Volker Markl
Informatik - Forschung und Entwicklung | VOL. 20
Stephan Ewen, et. al.Stephan Ewen ... Volker Markl
21 Sep 2005
Informatik - Forschung und Entwicklung | VOL. 20

A Multiple Continuous Query Optimization Method Based on Query Execution Pattern Analysis
Yousuke Watanabe ... Hiroyuki Kitagawa
-
Yousuke Watanabe, et. al.Yousuke Watanabe ... Hiroyuki Kitagawa
01 Jan 2004
01 Jan 2004

Speeding Up End-to-end Query Execution via Learning-based Progressive Cardinality Estimation
Fang Wang ... Man Lung Yiu
Proceedings of the ACM on Management of Data | VOL. 1
Fang Wang, et. al.Fang Wang ... Man Lung Yiu
26 May 2023
Proceedings of the ACM on Management of Data | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LPLM: A Neural Language Model for Cardinality Estimation of LIKE-Queries

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Management of Data