Effect of d-Dimensional Re-orderings on Lossless Compression of Radio-Astronomy and Digital Elevation Data

Conrad J Haupt,Ekow J Otoo,Ling Cheng

doi:10.1109/access.2021.3084838

Conrad J Haupt, Ekow J Otoo + Show 1 more

Open Access

https://doi.org/10.1109/access.2021.3084838

Copy DOI

Abstract

For multidimensional data, Space-Filling Curves (SFCs) have been used to improve the execution time of spatial data queries. However, their effect on compression, when used to reorder the uncompressed values, is known to a lesser extent. We investigate the impact of three SFCs on Shuttle Radar Topographic Mission (SRTM) elevation data and Square-Kilometre Array telescope (SKA) radio-astronomy data: two types of datasets to which SFCs have not been extensively applied, within a compression context. This work contributes to the understanding of how such reorderings impact compression performance and affect different compression schemes and preprocessing techniques through their use. We show empirical results from combining eight common compression schemes, the Z-Order, Gray-Code, and Hilbert space-filling curves, and the bitwise preprocessing technique BitShuffle. The Hilbert Curve consistently outperforms the other orderings for the SRTM dataset though the mapping implementation incurs a significant speed penalty. However, the Z-Order and Gray-Code Curves are best for the SKA dataset. Through an analysis of the dataset autocorrelations, file-entropies, and block-entropies; we show that the SKA dataset's dimensional bias is not exploited as much by the Hilbert Curve compared to the Z-Order and Gray-Code Curves. However, the Hilbert Curve is the most appropriate for the SRTM dataset as it can be modelled as isotropic and has a significantly higher level of local autocorrelation. BitShuffle is necessary to practically compress the SKA data, but does contribute to the compression performance of the SRTM dataset. These curves and BitShuffle are advantageous in reducing block-entropy values for such datasets.

Highlights

Space-Filling Curves are mappings between the onedimensional space and the d-dimensional space, and are used to improve query times of spatial data-structures by reordering and indexing the underlying values, preserving some spatial locality. Their d-dimensional orderings create curves which wrap around themselves and traverse local subregions, clustering nearby points together. This property results in some neighbouring d-dimensional values being closer in the one-dimensional space than if a standard row-major or raster scan was used; the extent to which is dependent on the type
As Space-Filling Curve (SFC) map between d and one dimensions, they are applicable for Machine Learning (ML) scenarios where data must be mapped into an alternative form appropriate for a given ML model, in some
In this paper we focus on three SFCs and Row-Major Order, which we treat as a reference SFC called the Raster Curve or Raster Scan

Summary

Introduction

Space-Filling Curves are mappings between the onedimensional space and the d-dimensional space, and are used to improve query times of spatial data-structures by reordering and indexing the underlying values, preserving some spatial locality. Their d-dimensional orderings create curves which wrap around themselves and traverse local subregions, clustering nearby points together. Lebesgue discovered the Z-Order curve by interleaving the bits of d integer coordinate values resulting an observable zigzag pattern [20] It was popularized by Morton in his work applying it to geodetic databases [21] and has been referred to as the Lebesgue Curve and the Morton Curve. Though different curves achieve the best metric in each paper, only the Z-Order, Gray-Code, and Hilbert Curves are covered in this paper to limit the scope

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Effect of d-Dimensional Re-orderings on Lossless Compression of Radio-Astronomy and Digital Elevation Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Journal: IEEE Access	Publication Date: Jan 1, 2021
License type: CC BY 4.0

Similar Papers

Flooded with Error: Handling Uncertainty in SRTM for the Assessment of Sea Level Rise in the Mississippi River Delta
Ameen A Kadhim ... Ashton M Shortridge
The Professional Geographer | VOL. 73
Ameen A Kadhim, et. al.Ameen A Kadhim ... Ashton M Shortridge
27 Apr 2021
The Professional Geographer | VOL. 73

Shuttle radar topography mission overview
J.L Kretsch
-
J.L KretschJ.L Kretsch
16 Oct 2000
16 Oct 2000

Efficient Range Query Using Multiple Hilbert Curves
Jing Dai
-
Jing DaiJing Dai
20 Jul 2011
20 Jul 2011

Self-Similar Structure in Hilbert's Space-Filling Curve
Mark Mcclure
Mathematics Magazine | VOL. 76
Mark McclureMark Mcclure
01 Feb 2003
Mathematics Magazine | VOL. 76

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effect of d-Dimensional Re-orderings on Lossless Compression of Radio-Astronomy and Digital Elevation Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access