Sequential sampling procedures for query size estimation

Peter J Haas,Arun N Swami

doi:10.1145/141484.130335

Abstract

We provide a procedure, based on random sampling, for estimation of the size of a query result. The procedure is sequential in that sampling terminates after a random number of steps according to a stopping rule that depends upon the observations obtained so far. Enough observations are obtained so that, with a pre-specified probability, the estimate differs from the true size of the query result by no more than a prespecified amount. Unlike previous sequential estimation procedures for queries, our procedure is asymptotically efficient and requires no ad hoc pilot sample or a a priori assumptions about data characteristics. In addition to establishing the asymptotic properties of the estimation procedure, we provide techniques for reducing undercoverage at small sample sizes and show that the sampling cost of the procedure can be reduced through stratified sampling techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sequential sampling procedures for query size estimation

Abstract

Talk to us

Similar Papers

More From: ACM SIGMOD Record

Lead the way for us

Journal: ACM SIGMOD Record	Publication Date: Jun 1, 1992
Citations: 36

Similar Papers

Sequential sampling procedures for query size estimation
Peter J Haas ... Arun N Swami
-
Peter J Haas, et. al.Peter J Haas ... Arun N Swami
01 Jun 1992
01 Jun 1992

Sequential Estimation of the Mean of a Log-Normal Distribution Having a Prescribed Proportional Closeness
S Zacks
The Annals of Mathematical Statistics | VOL. 37
S ZacksS Zacks
01 Dec 1966
The Annals of Mathematical Statistics | VOL. 37

Sequential prevalence estimation with pooling and continuous test outcomes.
Ngoc T Nguyen ... Hrayer Aprahamian
Statistics in Medicine | VOL. 37
Ngoc T Nguyen, et. al.Ngoc T Nguyen ... Hrayer Aprahamian
23 Apr 2018
Statistics in Medicine | VOL. 37

Distributions of stopping times in some sequential estimation procedures
Alicja Jokiel-Rokita ... Ryszard Magiera
Metrika | VOL. 77
Alicja Jokiel-Rokita, et. al.Alicja Jokiel-Rokita ... Ryszard Magiera
17 Aug 2013
Metrika | VOL. 77

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sequential sampling procedures for query size estimation

Abstract

Talk to us

Similar Papers

More From: ACM SIGMOD Record