Clustering and presorting for parallel burrows wheeler-based compression

Sergey Voronin,Eugene Borovikov,Raqibul Hasan

doi:10.1142/s1793962321500501

Clustering and presorting for parallel burrows wheeler-based compression

Sergey Voronin, Eugene Borovikov + Show 1 more

https://doi.org/10.1142/s1793962321500501

Copy DOI

Journal: International Journal of Modeling, Simulation, and Scientific Computing

Publication Date: Jul 5, 2021

Affiliation: Intelligent Automation (United States)

#Big Data Applications #Modern Day + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We describe practical improvements for parallel BWT-based lossless compressors frequently utilized in modern day big data applications. We propose a clustering-based data permutation approach for improving compression ratio for data with significant alphabet variation along with a faster string sorting approach based on the application of the [Formula: see text] complexity counting sort with permutation reindexing.

Full Text