Scaling the Construction of Wavelet Synopses for Maximum Error Metrics

Ioannis Mytilinis,Nectarios Koziris,Dimitrios Tsoumakos

doi:10.1109/tkde.2018.2867185

Abstract

Modern analytics involve computations over enormous numbers of data records. The volume of data and the stringent response-time requirements place increasing emphasis on the efficiency of approximate query processing. A major challenge over the past years has been the construction of synopses that provide a deterministic quality guarantee, often expressed in terms of a maximum error metric. By approximating sharp discontinuities, wavelet decomposition has proved to be a very effective tool for data reduction. However, existing wavelet thresholding schemes that minimize maximum error metrics are constrained with impractical complexities for large datasets. Furthermore, they cannot efficiently handle the multi-dimensional version of the problem. In order to provide a practical solution, we develop parallel algorithms that take advantage of key-properties of the wavelet decomposition and allocate tasks to multiple workers. To that end, we present (i) a general framework for the parallelization of existing dynamic programming algorithms, (ii) a parallel version of one such DP algorithm, and (iii) two highly efficient distributed greedy algorithms that can deal with data of arbitrary dimensionality. Our extensive experiments on both real and synthetic datasets over Hadoop show that the proposed algorithms achieve linear scalability and superior running-time performance compared to their centralized counterparts.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Scaling the Construction of Wavelet Synopses for Maximum Error Metrics

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Sep 1, 2019
Citations: 36

Similar Papers

Distributed Wavelet Thresholding for Maximum Error Metrics
Ioannis Mytilinis ... Dimitrios Tsoumakos
-
Ioannis Mytilinis, et. al.Ioannis Mytilinis ... Dimitrios Tsoumakos
14 Jun 2016
14 Jun 2016

Deterministic wavelet thresholding for maximum-error metrics
Minos Garofalakis ... Amit Kumar
-
Minos Garofalakis, et. al.Minos Garofalakis ... Amit Kumar
14 Jun 2004
14 Jun 2004

Hierarchical synopses with optimal error guarantees
Panagiotis Karras ... Nikos Mamoulis
ACM Transactions on Database Systems | VOL. 33
Panagiotis Karras, et. al.Panagiotis Karras ... Nikos Mamoulis
01 Aug 2008
ACM Transactions on Database Systems | VOL. 33

Computing Unrestricted Synopses Under Maximum Error Bound
Chaoyi Pang ... Anthony Maeder
Algorithmica | VOL. 65
Chaoyi Pang, et. al.Chaoyi Pang ... Anthony Maeder
05 Oct 2011
Algorithmica | VOL. 65

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scaling the Construction of Wavelet Synopses for Maximum Error Metrics

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering