Compression Scheme Research Articles

We present the GPU implementation efforts and challenges of the sparse solver package STRUMPACK. The code is made publicly available on github with a permissive BSD license. STRUMPACK implements an approximate multifrontal solver, a sparse LU factorization which makes use of compression methods to accelerate time to solution and reduce memory usage. Multiple compression schemes based on rank-structured and hierarchical matrix approximations are supported, including hierarchically semi-separable, hierarchically off-diagonal butterfly, and block low rank. In this paper, we present the GPU implementation of the block low rank (BLR) compression method within a multifrontal solver. Our GPU implementation relies on highly optimized vendor libraries such as cuBLAS and cuSOLVER for NVIDIA GPUs, rocBLAS and rocSOLVER for AMD GPUs and the Intel oneAPI Math Kernel Library (oneMKL) for Intel GPUs. Additionally, we rely on external open source libraries such as SLATE (Software for Linear Algebra Targeting Exascale), MAGMA (Matrix Algebra on GPU and Multi-core Architectures), and KBLAS (KAUST BLAS). SLATE is used as a GPU-capable ScaLAPACK replacement. From MAGMA we use variable sized batched dense linear algebra operations such as GEMM, TRSM and LU with partial pivoting. KBLAS provides efficient (batched) low rank matrix compression for NVIDIA GPUs using an adaptive randomized sampling scheme. The resulting sparse solver and preconditioner runs on NVIDIA, AMD and Intel GPUs. Interfaces are available from PETSc, Trilinos and MFEM, or the solver can be used directly in user code. We report results for a range of benchmark applications, using the Perlmutter system from NERSC, Frontier from ORNL, and Aurora from ALCF. For a high frequency wave equation on a regular mesh, using 32 Perlmutter compute nodes, the factorization phase of the exact GPU solver is about 6.5× faster compared to the CPU-only solver. The BLR-enabled GPU solver is about 13.8× faster than the CPU exact solver. For a collection of SuiteSparse matrices, the STRUMPACK exact factorization on a single GPU is on average 1.9× faster than NVIDIA’s cuDSS solver.

Read full abstract

ObjectivesElectrocardiogram (ECG) signals are beneficial for diagnosing cardiac diseases. The cardiac patients' life quality likely increases with continuous or long-period recording and monitoring of ECG signals, leading to better and early diagnosis of disease and heart attacks. However, continuous ECG recording necessitates high data rates and storage, which means high costs. Therefore, ECG compression is a handy concept that facilitates continuous monitoring of ECG signals. Deep neural networks open up new horizons for compression and also for ECG compression by providing high compression rates and quality. Although they bring constant compression ratios with better average quality, the compression quality of individual samples is not guaranteed, which may lead to misdiagnoses. This study aims to investigate the effect of compression quality on the diagnoses and to develop a deep neural network-based compression strategy that guarantees a quality-bound in return for varying compression ratios.Materials and methodsThe effect of the compression quality on the arrhythmia diagnoses is tested by comparing the performance of the deep learning-based ECG classifier on the original ECG recordings and the distorted recordings using a lossy compression algorithm with different compression error levels. Then, a compression error upper limit is calculated in terms of normalized percent root mean square difference (PRDN) error, which also coincides with the findings of the previous studies in the literature. Lastly, to enable deep learning in ECG compression, a single encoder-multi-decoder convolutional autoencoder architecture, and multiple quantization levels are proposed to guarantee a desired upper limit on the error rate.ResultsThe efficiency of the proposed method is demonstrated on a popular benchmark data set for ECG compression methods using a transfer learning approach. The PRDN error is fixed to various values, and the average compression rates are reported. An average of 13.019:1 compression is achieved for a 10% PRDN error rate, assessed as a fair quality threshold for reconstruction error. It has also been shown that the compression model has a runtime that can be run in real-time on wearable devices such as commercial smartwatches.ConclusionThis study proposes a deep learning-based ECG compression algorithm that guarantees a desired upper limit on the compression error. This model may facilitate an eHealth solution for continuous monitoring of ECG signals of individuals, especially cardiac patients.

Read full abstract

Compression Scheme Research Articles

Related Topics

Articles published on Compression Scheme

RGB-D Data Compression via Bi-Directional Cross-Modal Prior Transfer and Enhanced Entropy Modeling

Reduction of arrival time jitter or energy spread with arclike variable bunch compressors

Image reconstruction from compressed measurements for ultrasound NDT

A Knowledge Base Driven Task-Oriented Image Semantic Communication Scheme

Lossless Recompression of Vector Quantization Index Table for Texture Images Based on Adaptive Huffman Coding Through Multi-Type Processing

Coherent high-harmonic generation with laser-plasma beams

Accelerating and Compressing Transformer-Based PLMs for Enhanced Comprehension of Computer Terminology

Correction to: A remote sensing image encryption and compression scheme using novel hyperchaotic system and plaintext related random S-box

RFCSC: Communication efficient reinforcement federated learning with dynamic client selection and adaptive gradient compression

Quantum data compression under localized features

Adaptive compressed learning boosts both efficiency and utility of differentially private federated learning

Improving Cloud Database Performance Using Storage Technologies

Modular-Based Compression Scheme for Address Data in the Blockchain System for IoV Applications

A graphics processing unit accelerated sparse direct solver and preconditioner with block low rank compression

A remote sensing image encryption and compression scheme using novel hyperchaotic system and plaintext related random S-box

Assembly Theory is an approximation to algorithmic complexity based on LZ compression that does not explain selection or evolution

Efficient genetic code expansion without host genome modifications.

Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments.

Electrocardiogram Signal Compression Using Deep Convolutional Autoencoder with Constant Error and Flexible Compression Rate

Compression and Confusion Scheme with Perceptual Security for Smart Cities Applications

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Compression Scheme Research Articles

Related Topics

Articles published on Compression Scheme

RGB-D Data Compression via Bi-Directional Cross-Modal Prior Transfer and Enhanced Entropy Modeling

Reduction of arrival time jitter or energy spread with arclike variable bunch compressors

Image reconstruction from compressed measurements for ultrasound NDT

A Knowledge Base Driven Task-Oriented Image Semantic Communication Scheme

Lossless Recompression of Vector Quantization Index Table for Texture Images Based on Adaptive Huffman Coding Through Multi-Type Processing

Coherent high-harmonic generation with laser-plasma beams

Accelerating and Compressing Transformer-Based PLMs for Enhanced Comprehension of Computer Terminology

Correction to: A remote sensing image encryption and compression scheme using novel hyperchaotic system and plaintext related random S-box

RFCSC: Communication efficient reinforcement federated learning with dynamic client selection and adaptive gradient compression

Quantum data compression under localized features

Adaptive compressed learning boosts both efficiency and utility of differentially private federated learning

Improving Cloud Database Performance Using Storage Technologies

Modular-Based Compression Scheme for Address Data in the Blockchain System for IoV Applications

A graphics processing unit accelerated sparse direct solver and preconditioner with block low rank compression

A remote sensing image encryption and compression scheme using novel hyperchaotic system and plaintext related random S-box

Assembly Theory is an approximation to algorithmic complexity based on LZ compression that does not explain selection or evolution

Efficient genetic code expansion without host genome modifications.

Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments.

Electrocardiogram Signal Compression Using Deep Convolutional Autoencoder with Constant Error and Flexible Compression Rate

Compression and Confusion Scheme with Perceptual Security for Smart Cities Applications