The Multi-Tree Cubing algorithm for computing iceberg cubes

Xing Li,Kamran Karimi,Howard J Hamilton,Liqiang Geng

doi:10.1007/s10844-008-0074-3

Abstract

The computation of data cubes is one of the most expensive operations in on-line analytical processing (OLAP). To improve efficiency, an iceberg cube represents only the cells whose aggregate values are above a given threshold (minimum support). Top-down and bottom-up approaches are used to compute the iceberg cube for a data set, but both have performance limitations. In this paper, a new algorithm, called Multi-Tree Cubing (MTC), is proposed for computing an iceberg cube. The Multi-Tree Cubing algorithm is an integrated top-down and bottom-up approach. Overall control is handled in a top-down manner, so MTC features shared computation. By processing the orderings in the opposite order from the Top-Down Computation algorithm, the MTC algorithm is able to prune attributes. The Bottom Up Computation (BUC) algorithm and its variations also perform pruning by relying on the processing of intermediate partitions. The MTC algorithm, however, prunes without processing such partitions. The MTC algorithm is based on a specialized type of prefix tree data structure, called an Attribute–Partition tree (AP-tree), consisting of attribute and partition nodes. The AP-tree facilitates fast, in-memory sorting and APRIORI-like pruning. We report on five series of experiments, which confirm that MTC is consistently as fast or faster than BUC, while finding the same iceberg cubes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Multi-Tree Cubing algorithm for computing iceberg cubes

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent Information Systems

Lead the way for us

Journal: Journal of Intelligent Information Systems	Publication Date: Oct 30, 2008
Citations: 11

Similar Papers

Scalable distributed data cube computation for large-scale multidimensional data analysis on a Spark cluster
Suan Lee ... Eun Jung Yu
Cluster Computing | VOL. 22
Suan Lee, et. al.Suan Lee ... Eun Jung Yu
01 Feb 2018
Cluster Computing | VOL. 22

MM-Cubing: computing Iceberg cubes by factorizing the lattice space
...
-
, et. al. ...
21 Jun 2004
21 Jun 2004

Star-Cubing: Computing Iceberg Cubes by Top-Down and Bottom-Up Integration
Dong Xin ... Benjamin W Wah
Proceedings 2003 VLDB Conference | VOL. -
Dong Xin, et. al.Dong Xin ... Benjamin W Wah
01 Jan 2003
Proceedings 2003 VLDB Conference | VOL. -

Bottom-up computation of sparse and Iceberg CUBE
Kevin Beyer ... Raghu Ramakrishnan
-
Kevin Beyer, et. al.Kevin Beyer ... Raghu Ramakrishnan
01 Jun 1999
01 Jun 1999

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Multi-Tree Cubing algorithm for computing iceberg cubes

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent Information Systems