The case for a wide-table approach to manage sparse relational data sets

Eric Chu,Jeffrey Naughton,Jennifer Beckmann

doi:10.1145/1247480.1247571

Abstract

A data set typically has hundreds or even thousands of attributes, but most objects have non-null values for only a small number of these attributes. A popular view about sparse data is that it arises merely as the result of poor schema design. In this paper, we argue that rather than being the result of inept schema design,storing a sparse data set in a single table is the right way to proceed. However, for this to be the case, RDBMSs must provide sparse data management facilities that go beyond the previously studied requirement of storing such data sets efficiently. In particular, an RDBMS must 1) enable users to effectively build ad hoc queries over a very large number of attributes, and 2) support efficient evaluation of these queries over a wide, sparse table. We propose techniques that provide these capabilities, and argue that the single-table approach is a necessary component of self-managing database systems because it frees users from a tedious and potentially ineffective schema-design phase when managing sparse data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The case for a wide-table approach to manage sparse relational data sets

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Using sparse photometric data sets for asteroid lightcurve studies
Brian D Warner ... Alan W Harris
Icarus | VOL. 216
Brian D Warner, et. al.Brian D Warner ... Alan W Harris
20 Oct 2011
Icarus | VOL. 216

Stratigraphic uncertainty in sparse versus rich data sets in a fluvial-deltaic outcrop analog: Ferron Notom delta in the Henry Mountains region, southern Utah
Weiguo Li ... Janok P Bhattacharya
AAPG Bulletin | VOL. 96
Weiguo Li, et. al.Weiguo Li ... Janok P Bhattacharya
01 Mar 2012
AAPG Bulletin | VOL. 96

An effective crosswell seismic traveltime-estimation approach for quasi-continuous reservoir monitoring
Adeyemi Arogunmati ... Jerry M Harris
GEOPHYSICS | VOL. 77
Adeyemi Arogunmati, et. al.Adeyemi Arogunmati ... Jerry M Harris
01 Mar 2012
GEOPHYSICS | VOL. 77

MM-Cubing: computing Iceberg cubes by factorizing the lattice space
...
-
, et. al. ...
21 Jun 2004
21 Jun 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The case for a wide-table approach to manage sparse relational data sets

Abstract

Talk to us

Similar Papers