Beyond simple aggregates

Zhewei Wei,Ke Yi

doi:10.1145/1989284.1989299

Abstract

Database queries can be broadly classified into two categories: reporting queries and aggregation queries. The former retrieves a collection of records from the database that match the query's conditions, while the latter returns an aggregate, such as count, sum, average, or max (min), of a particular attribute of these records. Aggregation queries are especially useful in business intelligence and data analysis applications where users are interested not in the actual records, but some statistics of them. They can also be executed much more efficiently than reporting queries, by embedding properly precomputed aggregates into an index.However, reporting and aggregation queries provide only two extremes for exploring the data. Data analysts often need more insight into the data distribution than what those simple aggregates provide, and yet certainly do not want the sheer volume of data returned by reporting queries. In this paper, we design indexing techniques that allow for extracting a statistical summary of all the records in the query. The summaries we support include frequent items, quantiles, various sketches, and wavelets, all of which are of central importance in massive data analysis. Our indexes require linear space and extract a summary with the optimal or near-optimal query cost.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Beyond simple aggregates

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Indexing for summary queries
Ke Yi ... Lu Wang
ACM Transactions on Database Systems | VOL. 39
Ke Yi, et. al.Ke Yi ... Lu Wang
01 Jan 2014
ACM Transactions on Database Systems | VOL. 39

The Role of Leadership in Building the Foundations for Data Analytics, Visualization and Business Intelligence
Mwirigi Kiula ... John Kihoro
-
Mwirigi Kiula, et. al.Mwirigi Kiula ... John Kihoro
01 May 2019
01 May 2019

Classification Of Association Item Sets From Large Data Sets Based On User Awareness Using Hybrid
Srihari Varma Mantena ... Dr Cvpr Prasad
International Journal of Engineering and Advanced Technology | VOL. 8
Srihari Varma Mantena, et. al.Srihari Varma Mantena ... Dr Cvpr Prasad
06 Sep 2019
International Journal of Engineering and Advanced Technology | VOL. 8

User acceptance of business intelligence application: motivation to learn, technology, social influence, and situational constraints
Bong Keun Jeong ... Tom E Yoon
International Journal of Business Information Systems | VOL. 26
Bong Keun Jeong, et. al.Bong Keun Jeong ... Tom E Yoon
01 Jan 2017
International Journal of Business Information Systems | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Beyond simple aggregates

Abstract

Talk to us

Similar Papers