LAQy: Efficient and Reusable Query Approximations via Lazy Sampling

Viktor Sanca,Anastasia Ailamaki,Periklis Chrysogelos

doi:10.1145/3589319

Abstract

Modern analytical engines rely on Approximate Query Processing (AQP) to provide faster response times than the hardware allows for exact query answering. However, existing AQP methods impose steep performance penalties as workload unpredictability increases. Specifically, offline AQP relies on predictable workloads to create samples that match the queries in a priori to query execution, reducing query response times when queries match the expected workload. As soon as workload predictability diminishes, existing online AQP methods create query-specific samples with little reuse across queries, producing significantly smaller gains in response times. As a result, existing approaches cannot fully exploit the benefits of sampling under increased unpredictability. We analyze sample creation and propose LAQy, a framework for building, expanding, and merging samples to adapt to the changes in workload predicates. We show the main parameters that affect the sample creation time and propose lazy sampling to overcome the unpredictability issues that cause fast-but-specialized samples to be query-specific. We evaluate LAQy by implementing it in an in-memory code-generation-based scale-up analytical engine to show the adaptivity and practicality of our framework in a modern system. LAQy speeds up online sampling processing as a function of sample reuse ranging from practically zero to full online sampling time and from 2.5x to 19.3x in a simulated exploratory workload.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LAQy: Efficient and Reusable Query Approximations via Lazy Sampling

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Management of Data

Lead the way for us

Journal: Proceedings of the ACM on Management of Data	Publication Date: Jun 13, 2023
Citations: 1

Similar Papers

Estimating the Overlapping Area of Polygon Join
Leonardo Guerreiro Azevedo ... Geraldo Zimbrão
-
Leonardo Guerreiro Azevedo, et. al.Leonardo Guerreiro Azevedo ... Geraldo Zimbrão
01 Jan 2004
01 Jan 2004

AQP++: A Hybrid Approximate Query Processing Framework for Generalized Aggregation Queries
Yuxiang Wang ... Qiming Fang
-
Yuxiang Wang, et. al.Yuxiang Wang ... Qiming Fang
01 Aug 2016
01 Aug 2016

AQP++
Jinglin Peng ... Dongxiang Zhang
-
Jinglin Peng, et. al.Jinglin Peng ... Dongxiang Zhang
27 May 2018
27 May 2018

Approximate Query Processing: What is New and Where to Go?
Kaiyu Li ... Guoliang Li
Data Science and Engineering | VOL. 3
Kaiyu Li, et. al.Kaiyu Li ... Guoliang Li
14 Sep 2018
Data Science and Engineering | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LAQy: Efficient and Reusable Query Approximations via Lazy Sampling

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Management of Data