Towards Decoupling the Selection of Compression Algorithms from Quality Constraints – An Investigation of Lossy Compression Efficiency

Julian Kunkel ,Anastasiia Novikova ,Eugen Betke

doi:10.14529/jsfi170402

Abstract

Data intense scientific domains use data compression to reduce the storage space needed. Lossless data compression preserves information accurately but lossy data compression can achieve much higher compression rates depending on the tolerable error margins. There are many ways of defining precision and to exploit this knowledge, therefore, the field of lossy compression is subject to active research. From the perspective of a scientist, the qualitative definition about the implied loss of data precision should only matter. With the Scientific Compression Library (SCIL), we are developing a meta-compressor that allows users to define various quantities for acceptable error and expected performance behavior. The library then picks a suitable chain of algorithms yielding the user’s requirements, the ongoing work is a preliminary stage for the design of an adaptive selector. This approach is a crucial step towards a scientifically safe use of much-needed lossy data compression, because it disentangles the tasks of determining scientific characteristics of tolerable noise, from the task of determining an optimal compression strategy. Future algorithms can be used without changing application code. In this paper, we evaluate various lossy compression algorithms for compressing different scientific datasets (Isabel, ECHAM6), and focus on the analysis of synthetically created data that serves as blueprint for many observed datasets. We also briefly describe the available quantitiesof SCIL to define data precision and introduce two efficient compression algorithms for individualdata points. This shows that the best algorithm depends on user settings and data properties.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Supercomputing Frontiers and Innovations	Publication Date: Dec 1, 2017
Citations: 4	License type: cc-by

R Discovery Prime

R Discovery Prime

Towards Decoupling the Selection of Compression Algorithms from Quality Constraints – An Investigation of Lossy Compression Efficiency

Abstract

Talk to us

Similar Papers

More From: Supercomputing Frontiers and Innovations

Lead the way for us

Similar Papers

Toward Decoupling the Selection of Compression Algorithms from Quality Constraints
Julian Kunkel ... Anastasiia Novikova
-
Julian Kunkel, et. al.Julian Kunkel ... Anastasiia Novikova
01 Jan 2017
01 Jan 2017

Temporal Lossless and Lossy Compression in Wireless Sensor Networks
Yimei Li ... Yao Liang
ACM Transactions on Sensor Networks | VOL. 12
Yimei Li, et. al.Yimei Li ... Yao Liang
25 Oct 2016
ACM Transactions on Sensor Networks | VOL. 12

Performance evaluation of lossy quality compression algorithms for RNA-seq data
Rongshan Yu ... Wenxian Yang
BMC Bioinformatics | VOL. 21
Rongshan Yu, et. al.Rongshan Yu ... Wenxian Yang
20 Jul 2020
BMC Bioinformatics | VOL. 21

Efficient temporal compression in wireless sensor networks
Yao Liang
-
Yao LiangYao Liang
01 Oct 2011
01 Oct 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards Decoupling the Selection of Compression Algorithms from Quality Constraints – An Investigation of Lossy Compression Efficiency

Abstract

Talk to us

Similar Papers

More From: Supercomputing Frontiers and Innovations