New sampling-based summary statistics for improving approximate query answers

Phillip B Gibbons,Yossi Matias

doi:10.1145/276305.276334

Abstract

In large data recording and warehousing environments, it is often advantageous to provide fast, approximate answers to queries, whenever possible. Before DBMSs providing highly-accurate approximate answers can become a reality, many new techniques for summarizing data and for estimating answers from summarized data must be developed. This paper introduces two new sampling-based summary statistics, concise samples and counting samples, and presents new techniques for their fast incremental maintenance regardless of the data distribution. We quantify their advantages over standard sample views in terms of the number of additional sample points for the same view size, and hence in providing more accurate query answers. Finally, we consider their application to providing fast approximate answers to hot list queries. Our algorithms maintain their accuracy in the presence of ongoing insertions to the data warehouse.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

New sampling-based summary statistics for improving approximate query answers

Abstract

Talk to us

Similar Papers

More From: ACM SIGMOD Record

Lead the way for us

Journal: ACM SIGMOD Record	Publication Date: Jun 1, 1998
Citations: 61

Similar Papers

New sampling-based summary statistics for improving approximate query answers
Phillip B Gibbons ... Yossi Matias
-
Phillip B Gibbons, et. al.Phillip B Gibbons ... Yossi Matias
01 Jun 1998
01 Jun 1998

Approximate XML query answers
Neoklis Polyzotis ... Minos Garofalakis
-
Neoklis Polyzotis, et. al.Neoklis Polyzotis ... Minos Garofalakis
13 Jun 2004
13 Jun 2004

Probabilistic wavelet synopses
Minos Garofalakis ... Phillip B Gibbons
ACM Transactions on Database Systems | VOL. 29
Minos Garofalakis, et. al.Minos Garofalakis ... Phillip B Gibbons
01 Mar 2004
ACM Transactions on Database Systems | VOL. 29

CoXML: Cooperative XML Query Answering?
Wesley W Chu ... Shaorong Liu
-
Wesley W Chu, et. al.Wesley W Chu ... Shaorong Liu
14 Dec 2007
14 Dec 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

New sampling-based summary statistics for improving approximate query answers

Abstract

Talk to us

Similar Papers

More From: ACM SIGMOD Record