Dynamic Virtual Chunks: On Supporting Efficient Accesses to Compressed Scientific Data

Dongfang Zhao,Kan Qiao,Jian Yin,Ioan Raicu

doi:10.1109/tsc.2015.2456889

Abstract

Data compression could ameliorate the I/O pressure of data-intensive scientific applications. Unfortunately, the conventional wisdom of naively applying data compression to the file or block brings the dilemma between efficient random accesses and high compression ratios. File-level compression barely supports efficient random accesses to the compressed data: any retrieval request need trigger the decompression from the beginning of the compressed file. Block-level compression provides flexible random accesses to the compressed blocks, but introduces extra overhead when applying the compressor to each and every block that results in a degraded overall compression ratio. This paper extends our prior work that introduces virtual chunks offering efficient random accesses to the compressed scientific data without sacrificing the compression ratio. Virtual chunks are logical blocks pointed at by appended references without breaking the physical continuity of the file content. These references allow the decompression to start from an arbitrary position (efficient random accesses), while no per-block overhead is introduced because the file's physical entirety is retained (high compression ratio). One limitation of virtual chunk is it only supports static references. This paper presents the algorithms, analysis, and evaluations of dynamic virtual chunks to deal with the cases where the references are updated dynamically.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dynamic Virtual Chunks: On Supporting Efficient Accesses to Compressed Scientific Data

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Services Computing

Lead the way for us

Journal: IEEE Transactions on Services Computing	Publication Date: Jan 1, 2016
Citations: 19

Similar Papers

Virtual chunks: On supporting random accesses to scientific data in compressible storage systems
Dongfang Zhao ... Ioan Raicu
-
Dongfang Zhao, et. al.Dongfang Zhao ... Ioan Raicu
01 Oct 2014
01 Oct 2014

High-Ratio Compression for Machine-Generated Data
Jiujing Zhang ... Yue Li
Proceedings of the ACM on Management of Data | VOL. 1
Jiujing Zhang, et. al.Jiujing Zhang ... Yue Li
08 Dec 2023
Proceedings of the ACM on Management of Data | VOL. 1

LCQS: an efficient lossless compression tool of quality scores with random access functionality
Jiabing Fu ... Shoubin Dong
BMC Bioinformatics | VOL. 21
Jiabing Fu, et. al.Jiabing Fu ... Shoubin Dong
18 Mar 2020
BMC Bioinformatics | VOL. 21

Partially Decodable Compression with Static PPM
D Okanohara
-
D OkanoharaD Okanohara
29 Mar 2005
29 Mar 2005

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic Virtual Chunks: On Supporting Efficient Accesses to Compressed Scientific Data

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Services Computing