Compression Ratio Modeling and Estimation across Error Bounds for Lossy Compression

Jinzhen Wang,Tong Liu,Weiming He,Qing Liu,Xubin He,Huizhang Luo

doi:10.1109/tpds.2019.2938503

Jinzhen Wang, Tong Liu + Show 4 more

Open Access

https://doi.org/10.1109/tpds.2019.2938503

Copy DOI

Abstract

Scientific simulations on high-performance computing (HPC) systems generate vast amounts of floating-point data that need to be reduced in order to lower the storage and I/O cost. Lossy compressors trade data accuracy for reduction performance and have been demonstrated to be effective in reducing data volume. However, a key hurdle to wide adoption of lossy compressors is that the trade-off between data accuracy and compression performance, particularly the compression ratio, is not well understood. Consequently, domain scientists often need to exhaust many possible error bounds before they can figure out an appropriate setup. The current practice of using lossy compressors to reduce data volume is, therefore, through trial and error, which is not efficient for large datasets which take a tremendous amount of computational resources to compress. This paper aims to analyze and estimate the compression performance of lossy compressors on HPC datasets. In particular, we predict the compression ratios of two modern lossy compressors that achieve superior performance, SZ and ZFP, on HPC scientific datasets at various error bounds, based upon the compressors' intrinsic metrics collected under a given base error bound. We evaluate the estimation scheme using twenty real HPC datasets and the results confirm the effectiveness of our approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Parallel and Distributed Systems	Publication Date: Jul 1, 2020
Citations: 11	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Compression Ratio Modeling and Estimation across Error Bounds for Lossy Compression

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems

Lead the way for us

Similar Papers

High-Ratio Lossy Compression: Exploring the Autoencoder to Compress Scientific Data
Tong Liu ... Xubin He
IEEE Transactions on Big Data | VOL. 9
Tong Liu, et. al.Tong Liu ... Xubin He
18 Mar 2021
IEEE Transactions on Big Data | VOL. 9

Towards Guaranteeing Error Bound in DCT-based Lossy Compression
Jiaxi Chen ... Aekyeung Moon
-
Jiaxi Chen, et. al.Jiaxi Chen ... Aekyeung Moon
17 Dec 2022
17 Dec 2022

Characterization of Transform-Based Lossy Compression for HPC Datasets
Aekyeung Moon ... Jiaxi Chen
-
Aekyeung Moon, et. al.Aekyeung Moon ... Jiaxi Chen
01 Nov 2022
01 Nov 2022

Evaluating Lossy Compression on Climate Data
Nathanael Hübbe ... Al Wegener
-
Nathanael Hübbe, et. al.Nathanael Hübbe ... Al Wegener
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Compression Ratio Modeling and Estimation across Error Bounds for Lossy Compression

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems