MM-Cubing: computing Iceberg cubes by factorizing the lattice space

Shu Zheng ,Jiawei Han ,Xin Luna Dong

doi:10.1109/ssdbm.2004.53

Abstract

The data cube and iceberg cube computation problem has been studied by many researchers. There are three major approaches developed in this direction: (1) top-down computation, represented by MultiWay array aggregation (Zhao et. al., 1997) which utilizes shared computation and performs well on dense data sets; (2) bottom-up computation, represented by BUC (Beyer and Ramakrishnan, 1999), which takes advantage of Apriori Pruning and performs well on sparse data sets; and (3) integrated top-down and bottom-up computation, represented by Star-Cubing (Xin, et. al., 2003), which takes advantages of both and has high performance in most cases. However; the performance of Star-Cubing degrades in very sparse data sets due to the additional cost introduced by the tree structure. None of the three approaches achieves uniformly high performance on all kinds of data sets. In this paper; we present a new approach that compute Iceberg Cubes by factorizing the lattice space according to the frequency of values. This approach, different from all the previous dimension-based approaches where the importance of data distribution is not recognized, partitions the cube lattice into one dense subspace and several sparse subspaces. With this approach, a new method called MM-Cubing has been developed. MM-Cubing is highly adaptive to dense, sparse or skewed data sets. Our performance study shows that MM-Cubing is efficient and achieves high performance over all kinds of data distributions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MM-Cubing: computing Iceberg cubes by factorizing the lattice space

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Scalable distributed data cube computation for large-scale multidimensional data analysis on a Spark cluster
Suan Lee ... Eun Jung Yu
Cluster Computing | VOL. 22
Suan Lee, et. al.Suan Lee ... Eun Jung Yu
01 Feb 2018
Cluster Computing | VOL. 22

Weighted Frequent Itemset Mining Using Weighted Subtrees: WST-WFIM
Saeed Nalousi ... Amin Babazadeh Sangar
IEEE Canadian Journal of Electrical and Computer Engineering | VOL. 44
Saeed Nalousi, et. al.Saeed Nalousi ... Amin Babazadeh Sangar
01 Jan 2020
IEEE Canadian Journal of Electrical and Computer Engineering | VOL. 44

Comparative Analysis of Online Rating Systems
Mohammad Azzeh
International Journal of Advanced Computer Science and Applications | VOL. 8
Mohammad AzzehMohammad Azzeh
01 Jan 2017
International Journal of Advanced Computer Science and Applications | VOL. 8

Using sparse photometric data sets for asteroid lightcurve studies
Brian D Warner ... Alan W Harris
Icarus | VOL. 216
Brian D Warner, et. al.Brian D Warner ... Alan W Harris
20 Oct 2011
Icarus | VOL. 216

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MM-Cubing: computing Iceberg cubes by factorizing the lattice space

Abstract

Talk to us

Similar Papers