Versatile Multidimensional Histograms for Different Data Distributions

Wei Cao,Xiong-Pai Qin,Qiu-Yue Wang,Shan Wang

doi:10.3724/sp.j.1016.2008.01013

Abstract

Traditional multidimensional histograms,which are widely used in cardinality estimation for conjunctive range query predicates in RDBMS′s query optimizers,take the assumption of the existence of correlations among attributes instead of the plausible AVI assumption.But they do not further discriminate between different degrees of correlations among attributes.Based on accurate measurements of data distributions,data correlated coefficients and value domain density,the authors propose different optimal multidimensional histograms for different data distributions,COCA-Hist.Also they analyze the worst cases for traditional MHist-2 histograms and find effective ways to alleviate the situation.The authors conduct experiments to compare the accuracy and performance between COCA-Hist,and MHist-2,GENHist and STHoles.The results demonstrate that COCA-Hist histograms are superior in accuracy and performance than MHist-2 either in average case or in worst case.In the soft functional dependence situation,COCA-Hist is much better in either accuracy or building-up time by orders of magnitudes than GENHist.Under limited space budgets,COCA-Hist is one order of magnitude efficient than STHoles in building-up time.While STHoles exhibits good accuracy under sufficient space budget,in average COCA-Hist can achieve relatively better accuracy than STHoles.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Versatile Multidimensional Histograms for Different Data Distributions

Abstract

Talk to us

Similar Papers

More From: Chinese Journal of Computers

Lead the way for us

Similar Papers

Multi-dimensional Histograms with Tight Bounds for the Error
Linas Baltrunas ... Michael Bohlen
-
Linas Baltrunas, et. al.Linas Baltrunas ... Michael Bohlen
01 Dec 2006
01 Dec 2006

Self-tuning histograms
Ashraf Aboulnaga ... Surajit Chaudhuri
-
Ashraf Aboulnaga, et. al.Ashraf Aboulnaga ... Surajit Chaudhuri
01 Jun 1999
01 Jun 1999

Self-tuning histograms
Ashraf Aboulnaga ... Surajit Chaudhuri
ACM SIGMOD Record | VOL. 28
Ashraf Aboulnaga, et. al.Ashraf Aboulnaga ... Surajit Chaudhuri
01 Jun 1999
ACM SIGMOD Record | VOL. 28

Containment and Optimization of Object-Preserving Conjunctive Queries
Edward P F Chan ... Ron Van Der Meyden
SIAM Journal on Computing | VOL. 29
Edward P F Chan, et. al.Edward P F Chan ... Ron Van Der Meyden
01 Jan 1999
SIAM Journal on Computing | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Versatile Multidimensional Histograms for Different Data Distributions

Abstract

Talk to us

Similar Papers

More From: Chinese Journal of Computers