Prediction In Video Coding Research Articles

Distributed video coding (DVC) is based on distributed source coding (DSC) concepts in which video statistics are used partially or completely at the decoder rather than the encoder. The rate-distortion (RD) performance of distributed video codecs substantially lags the conventional predictive video coding. Several techniques and methods are employed in DVC to overcome this performance gap and achieve high coding efficiency while maintaining low encoder computational complexity. However, it is still challenging to achieve coding efficiency and limit the computational complexity of the encoding and decoding process. The deployment of distributed residual video coding (DRVC) improves coding efficiency, but significant enhancements are still required to reduce these gaps. This paper proposes the QUAntized Transform ResIdual Decision (QUATRID) scheme that improves the coding efficiency by deploying the Quantized Transform Decision Mode (QUAM) at the encoder. The proposed QUATRID scheme's main contribution is a design and integration of a novel QUAM method into DRVC that effectively skips the zero quantized transform (QT) blocks, thus limiting the number of input bit planes to be channel encoded and consequently reducing both the channel encoding and decoding computational complexity. Moreover, an online correlation noise model (CNM) is specifically designed for the QUATRID scheme and implemented at its decoder. This online CNM improves the channel decoding process and contributes to the bit rate reduction. Finally, a methodology for the reconstruction of the residual frame (R^) is developed that utilizes the decision mode information passed by the encoder, decoded quantized bin, and transformed estimated residual frame. The Bjøntegaard delta analysis of experimental results shows that the QUATRID achieves better performance over the DISCOVER by attaining the PSNR between 0.06 dB and 0.32 dB and coding efficiency, which varies from 5.4 to 10.48 percent. In addition to this, results determine that for all types of motion videos, the proposed QUATRID scheme outperforms the DISCOVER in terms of reducing the number of input bit-planes to be channel encoded and the entire encoder's computational complexity. The number of bit plane reduction exceeds 97%, while the entire Wyner-Ziv encoder and channel coding computational complexity reduce more than nine-fold and 34-fold, respectively.

Read full abstract

Causal video coding is a coding paradigm where video source frames X 1 , X 2 ,..., X N are encoded in a frame-by-frame manner, the encoder for each frame can use all previous source frames and all previous encoded frames, and the corresponding decoder can use only all previous encoded frames. In the special case where the encoder for each frame X k is further restricted to enlist help only from all previous encoded frames, causal video coding is reduced to predictive video coding, which all MPEG-series and H-series video coding standards proposed so far are based upon. In this paper, we compare the rate distortion performance of causal video coding with that of predictive video coding from an information theoretic perspective by modeling each frame X k itself as a source X k ={X k (i)} i=1 ∞ . Let R c *(D 1, ...,D N ) (R p *(D1,...,DN), respectively) denote the minimum total rate required to achieve a given distortion level D 1 ,...,D N in causal video coding (predictive video coding, respectively). We first show that like R c *(D1,..., D N ), for jointly stationary and totally ergodic sources X 1 , X 2 ,..., XN, R p *(D 1 ,...,D N ) is equal to the infimum of the nth order total rate distortion function R p,n (D1,...,DN) over all n, where R p,n (D 1 ,...,D N ) itself is given by the minimum of an information quantity over a set of auxiliary random variables. We then prove that if the jointly stationary and totally ergodic sources X 1 ,..., X N form a (first-order) Markov chain, we have R p *(D 1 ,...,D N )=R c *(D 1 ,...,D N ). However, this is not true in general if X 1 ,..., X N do not form a (first-order) Markov chain. Specifically, we demonstrate that for independent and identically distributed vector source (X 1 ,..., X N ), if X 1 ,..., X N do not form a (first-order) Markov chain, then under some conditions on source frames and distortion, R c *(D 1 ,..., D N ) is strictly less than R p *(D 1 ,..., D N ) in general. Our techniques allow us to compare R p *(D 1 ,..., D N ) with R c *(D 1 ,..., D N ) even when the single-letter characterization of R p *(D 1 ,..., D N ), if any, is unknown.

Read full abstract

Prediction In Video Coding Research Articles

Related Topics

Articles published on Prediction In Video Coding

Spatio-Temporal Convolutional Neural Network for Enhanced Inter Prediction in Video Coding.

Chroma Intra Prediction With Lightweight Attention-Based Neural Networks

Low Computational Coding-Efficient Distributed Video Coding: Adding a Decision Mode to Limit Channel Coding Load

Tree-Structured Data Clustering-Driven Neural Network for Intra Prediction in Video Coding.

Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding

Improved CNN-Based Learning of Interpolation Filters for Low-Complexity Inter Prediction in Video Coding

A simple encoder scheme for distributed residual video coding

Analysis of Affine Motion-Compensated Prediction in Video Coding

Distortion propagation modeling and its applications on frame level quantization control for predictive video coding

Transform Competition for Temporal Prediction in Video Coding

Intra-Prediction Mode Propagation for Video Coding

Reference Clip for Inter Prediction in Video Coding

Hybrid Wyner-Ziv Video Coding with No Feedback Channel

Rate Allocation in Predictive Video Coding Using a Convex Optimization Framework.

Multi-directional Mode Reduction Algorithm for Intra Prediction in Video Coding

Distributed video coding: Assessing the HEVC upgrade

Recursive Prediction for Joint Spatial and Temporal Prediction in Video Coding

On the Information Theoretic Performance Comparison of Causal Video Coding and Predictive Video Coding

Content-based irregularly shaped macroblock partition for inter frame prediction in video coding

Lossless Image Compression Using Super-Spatial Structure Prediction

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Prediction In Video Coding Research Articles

Related Topics

Articles published on Prediction In Video Coding

Spatio-Temporal Convolutional Neural Network for Enhanced Inter Prediction in Video Coding.

Chroma Intra Prediction With Lightweight Attention-Based Neural Networks

Low Computational Coding-Efficient Distributed Video Coding: Adding a Decision Mode to Limit Channel Coding Load

Tree-Structured Data Clustering-Driven Neural Network for Intra Prediction in Video Coding.

Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding

Improved CNN-Based Learning of Interpolation Filters for Low-Complexity Inter Prediction in Video Coding

A simple encoder scheme for distributed residual video coding

Analysis of Affine Motion-Compensated Prediction in Video Coding

Distortion propagation modeling and its applications on frame level quantization control for predictive video coding

Transform Competition for Temporal Prediction in Video Coding

Intra-Prediction Mode Propagation for Video Coding

Reference Clip for Inter Prediction in Video Coding

Hybrid Wyner-Ziv Video Coding with No Feedback Channel

Rate Allocation in Predictive Video Coding Using a Convex Optimization Framework.

Multi-directional Mode Reduction Algorithm for Intra Prediction in Video Coding

Distributed video coding: Assessing the HEVC upgrade

Recursive Prediction for Joint Spatial and Temporal Prediction in Video Coding

On the Information Theoretic Performance Comparison of Causal Video Coding and Predictive Video Coding

Content-based irregularly shaped macroblock partition for inter frame prediction in video coding

Lossless Image Compression Using Super-Spatial Structure Prediction