Rate-distortion Performance Research Articles

The emerging field semantic communication is driving the research of end-to-end data transmission. By utilizing the powerful representation ability of deep learning models, learned data transmission schemes have exhibited superior performance than the established source and channel coding methods. While, so far, research efforts mainly concentrated on architecture and model improvements toward a static target domain. Despite their successes, such learned models are still suboptimal due to the limitations in model capacity and imperfect optimization and generalization, particularly when the testing data distribution or channel response is different from that adopted for model training, as is likely to be the case in real-world. To tackle this, in this paper, we propose a novel online learned joint source and channel coding approach that leverages the deep learning model’s overfitting property. Specifically, we update the off-the-shelf pre-trained models after deployment in a lightweight online fashion to adapt to the distribution shifts in source data and environment domain. We take the overfitting concept to the extreme, proposing a series of implementation-friendly methods to adapt the codec model or representations to an individual data or channel state instance, which can further lead to substantial gains in terms of the end-to-end rate-distortion performance. Accordingly, the streaming ingredients include both the semantic representations of source data and the online updated decoder model parameters. The system design is formulated as a joint optimization problem whose goal is to minimize the loss function, a tripartite trade-off among the data stream bandwidth cost, model stream bandwidth cost, and end-to-end distortion. The proposed methods enable the communication-efficient adaptation for all parameters in the network without sacrificing decoding speed. Extensive experiments, including user study, on continually changing target source data and wireless channel environments, demonstrate the effectiveness and efficiency of our approach, on which we outperform existing state-of-the-art engineered transmission scheme (VVC combined with 5G LDPC coded transmission).

Read full abstract

Distributed video coding (DVC) is based on distributed source coding (DSC) concepts in which video statistics are used partially or completely at the decoder rather than the encoder. The rate-distortion (RD) performance of distributed video codecs substantially lags the conventional predictive video coding. Several techniques and methods are employed in DVC to overcome this performance gap and achieve high coding efficiency while maintaining low encoder computational complexity. However, it is still challenging to achieve coding efficiency and limit the computational complexity of the encoding and decoding process. The deployment of distributed residual video coding (DRVC) improves coding efficiency, but significant enhancements are still required to reduce these gaps. This paper proposes the QUAntized Transform ResIdual Decision (QUATRID) scheme that improves the coding efficiency by deploying the Quantized Transform Decision Mode (QUAM) at the encoder. The proposed QUATRID scheme's main contribution is a design and integration of a novel QUAM method into DRVC that effectively skips the zero quantized transform (QT) blocks, thus limiting the number of input bit planes to be channel encoded and consequently reducing both the channel encoding and decoding computational complexity. Moreover, an online correlation noise model (CNM) is specifically designed for the QUATRID scheme and implemented at its decoder. This online CNM improves the channel decoding process and contributes to the bit rate reduction. Finally, a methodology for the reconstruction of the residual frame (R^) is developed that utilizes the decision mode information passed by the encoder, decoded quantized bin, and transformed estimated residual frame. The Bjøntegaard delta analysis of experimental results shows that the QUATRID achieves better performance over the DISCOVER by attaining the PSNR between 0.06 dB and 0.32 dB and coding efficiency, which varies from 5.4 to 10.48 percent. In addition to this, results determine that for all types of motion videos, the proposed QUATRID scheme outperforms the DISCOVER in terms of reducing the number of input bit-planes to be channel encoded and the entire encoder's computational complexity. The number of bit plane reduction exceeds 97%, while the entire Wyner-Ziv encoder and channel coding computational complexity reduce more than nine-fold and 34-fold, respectively.

Read full abstract

Rate-distortion Performance Research Articles

Articles published on Rate-distortion Performance

Corner-to-Center long-range context model for efficient learned image compression

Medical Image Compression Using Block-to-Row Principal Component Analysis (BTRPCA)

Belief Propagation Optimization for Lossy Compression Based on Gaussian Source

A new paradigm for high-capacity reversible data hiding with pixel repetition and adaptive embedding

Toward Adaptive Semantic Communications: Efficient Data Transmission via Online Learned Nonlinear Transform Source-Channel Coding

Block-Adaptive Point Cloud Attribute Coding With Region-Aware Optimized Transform

Asymmetric Learned Image Compression With Multi-Scale Residual Block, Importance Scaling, and Post-Quantization Filtering

Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer

Rate distortion optimization with adaptive content modeling for random-access versatile video coding

Variable Rate Point Cloud Geometry Compression Method

Learned Progressive Image Compression With Dead-Zone Quantizers

Multiple hypotheses based motion compensation for learned video compression

Impact of Image Compression on In Vitro Cell Migration Analysis

Nonlinear Transforms in Learned Image Compression From a Communication Perspective

Advanced quantum image representation and compression using a DCT-EFRQI approach

Remote sensing image compression with long-range convolution and improved non-local attention model

Learning Context-Based Nonlocal Entropy Modeling for Image Compression

Improving distributed video coding with deep learning

Lossy P-LDPC Codes for Compressing General Sources Using Neural Networks

Low Computational Coding-Efficient Distributed Video Coding: Adding a Decision Mode to Limit Channel Coding Load

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Rate-distortion Performance Research Articles

Articles published on Rate-distortion Performance

Corner-to-Center long-range context model for efficient learned image compression

Medical Image Compression Using Block-to-Row Principal Component Analysis (BTRPCA)

Belief Propagation Optimization for Lossy Compression Based on Gaussian Source

A new paradigm for high-capacity reversible data hiding with pixel repetition and adaptive embedding

Toward Adaptive Semantic Communications: Efficient Data Transmission via Online Learned Nonlinear Transform Source-Channel Coding

Block-Adaptive Point Cloud Attribute Coding With Region-Aware Optimized Transform

Asymmetric Learned Image Compression With Multi-Scale Residual Block, Importance Scaling, and Post-Quantization Filtering

Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer

Rate distortion optimization with adaptive content modeling for random-access versatile video coding

Variable Rate Point Cloud Geometry Compression Method

Learned Progressive Image Compression With Dead-Zone Quantizers

Multiple hypotheses based motion compensation for learned video compression

Impact of Image Compression on In Vitro Cell Migration Analysis

Nonlinear Transforms in Learned Image Compression From a Communication Perspective

Advanced quantum image representation and compression using a DCT-EFRQI approach

Remote sensing image compression with long-range convolution and improved non-local attention model

Learning Context-Based Nonlocal Entropy Modeling for Image Compression

Improving distributed video coding with deep learning

Lossy P-LDPC Codes for Compressing General Sources Using Neural Networks

Low Computational Coding-Efficient Distributed Video Coding: Adding a Decision Mode to Limit Channel Coding Load