Compact Data Structure Research Articles

Multidimensional data analysis has attracted a lot of research efforts during past years. One of the aspects that has been addressed so far is that to allow users to analyze their data from different perspectives, each of which corresponds to a selected subset of dimensions. To optimize these analysis queries, precomputation, and materialization, are among most studied solutions. In the context of skyline analysis, the skycube structure has been proposed as an optimization structure to allow users to ask for the non dominated records with respect to every selected dimensions set. More precisely, given a set of dimensions D={D1,…,Dd} and a relation T(id,D), the Skycube of T is the set of all skylines obtained by considering each of the subsets of D (subspaces). To make the Skycube practically useful, two lines of research have been pursued so far: the first one aims to propose efficient algorithms for computing it. Note that the number of these skylines is exponential w.r.t. |D|. Hence, both execution time and storage space make these solutions struggling with even moderately large datasets, say |D| larger than 10 and the number of tuples greater than 106 . This motivated the second line of researches which propose Skycube summarization techniques to reduce both time and space consumption. Both lines of research, store the whole or a summary of the following information: “for every tuple t, keep track of the dimensions subsets X (subspaces) where t belongs to the respective skyline”. In this paper, we consider the complementary statement, i.e., “for every tuple t, we store a compact data structure encoding the subspaces X with respect to which, tis dominated”. This is what we call the negative skycube. Despite the apparent equivalence between the two statements (dominated vs not dominated), our analysis and extensive experiments show that these two points of view do not lead to the same behavior of the related algorithms. More specifically, our proposal shows that: (i) the negative summary can be obtained much faster than state of the art techniques for positive summaries, (ii) in general, it consumes less memory space, (iii) skyline queries evaluation using this summary is much faster, (iv) the positive Skycube can be obtained more rapidly than state of the art algorithms especially designed for this purpose, and (v) it is highly effective with respect to insertions and deletions.

Read full abstract

A filtration over a simplicial complex K is an ordering of the simplices of K such that all prefixes in the ordering are subcomplexes of K . Filtrations are at the core of Persistent Homology, a major tool in Topological Data Analysis. To represent the filtration of a simplicial complex, the entire filtration can be appended to any data structure that explicitly stores all the simplices of the complex such as the Hasse diagram or the recently introduced Simplex Tree [Algorithmica’14]. However, with the popularity of various computational methods that need to handle simplicial complexes, and with the rapidly increasing size of the complexes, the task of finding a compact data structure that can still support efficient queries is of great interest. This direction has been recently pursued for the case of maintaining simplicial complexes. For instance, Boissonnat et al. [Algorithmica’17] considered storing the simplices that are maximal with respect to inclusion and Attali et al. [IJCGA’12] considered storing the simplices that block the expansion of the complex. Nevertheless, so far there has been no data structure that compactly stores the filtration of a simplicial complex, while also allowing the efficient implementation of basic operations on the complex. In this article, we propose a new data structure called the Critical Simplex Diagram (CSD), which is a variant of the Simplex Array List [Algorithmica’17]. Our data structure allows one to store in a compact way the filtration of a simplicial complex and allows for the efficient implementation of a large range of basic operations. Moreover, we prove that our data structure is essentially optimal with respect to the requisite storage space. Finally, we show that the CSD representation admits fast construction algorithms for Flag complexes and relaxed Delaunay complexes.

Read full abstract

Compact Data Structure Research Articles

Articles published on Compact Data Structure

Photon mapping to accelerate daylight simulation with high-resolution, data-driven fenestration models

Using Predictive and Differential Methods with K2-Raster Compact Data Structure for Hyperspectral Image Lossless Compression

SPATIAL ADJACENCY ANALYSIS OF CITYGML BUILDINGS VIA 3D TOPOLOGICAL DATA STRUCTURE

The negative skycube

Sequence tube maps: making graph genomes intuitive to commuters.

Exact Representations and Geometric Queries for Lattice Structures with Quador Beams

HUOPM: High-Utility Occupancy Pattern Mining.

Energy Consumption in Compact Integer Vectors: A Study Case

Accelerating Sequence Alignments Based on FM-Index Using the Intel KNL Processor.

The Book Review Column

Review of Compact Data Structures - a practical approach by Gonzalo Navarro

Path queries on functions

Compact and efficient representation of general graph databases

A Compact Face-Based Topological Data Structure for Triangle Mesh Representation

Compact Data Structures to Represent and Query Data Warehouses into Main Memory

An Efficient Representation for Filtrations of Simplicial Complexes

Accelerated partial decoding in wavelet trees

GPU-accelerated generation and rendering of multi-level voxel representations of solid models

Using Compressed Suffix-Arrays for a compact representation of temporal-graphs

Constraint-Based Inference in Probabilistic Logic Programs

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Compact Data Structure Research Articles

Articles published on Compact Data Structure

Photon mapping to accelerate daylight simulation with high-resolution, data-driven fenestration models

Using Predictive and Differential Methods with K2-Raster Compact Data Structure for Hyperspectral Image Lossless Compression

SPATIAL ADJACENCY ANALYSIS OF CITYGML BUILDINGS VIA 3D TOPOLOGICAL DATA STRUCTURE

The negative skycube

Sequence tube maps: making graph genomes intuitive to commuters.

Exact Representations and Geometric Queries for Lattice Structures with Quador Beams

HUOPM: High-Utility Occupancy Pattern Mining.

Energy Consumption in Compact Integer Vectors: A Study Case

Accelerating Sequence Alignments Based on FM-Index Using the Intel KNL Processor.

The Book Review Column

Review of Compact Data Structures - a practical approach by Gonzalo Navarro

Path queries on functions

Compact and efficient representation of general graph databases

A Compact Face-Based Topological Data Structure for Triangle Mesh Representation

Compact Data Structures to Represent and Query Data Warehouses into Main Memory

An Efficient Representation for Filtrations of Simplicial Complexes

Accelerated partial decoding in wavelet trees

GPU-accelerated generation and rendering of multi-level voxel representations of solid models

Using Compressed Suffix-Arrays for a compact representation of temporal-graphs

Constraint-Based Inference in Probabilistic Logic Programs