Versatile Video Coding Research Articles

As a rapid development of neural-network-based machine learning algorithms, deep learning methods are being tentatively used in a much wider range than well-known artificial intelligence applications such as face recognition or auto-driving. Recently, deep learning models are investigated intensively to improve the compression efficiency for video coding, especially at the in-loop filtering stage. Although deep learning-based in-loop filtering methods in prior arts have already shown a remarkable potential capability in video coding, content propagation issue is still not well recognized and addressed yet. Content propagation is the fact that contents of reference frames are propagated to frames referring to them, which typically leads to over-filtering issues. In this article, we develop an iteratively trained deep in-loop filter with adaptive model selection (iDAM) to address the content propagation issue. First, we propose an iterative training scheme, which enables the network to gradually take into account the impacts of content propagation. Second, we propose a filter selection mechanism, i.e., allowing a block to select from a set of candidate filters with different filtering strengths. Besides, we propose a novel approach to design a conditional in-loop filtering method that can deal with multiple quality levels with a single model and serve the functionality of filter selection by modifying the input parameters. Extensive experiments on top of the latest video coding standard (Versatile Video Coding, VVC) have been conducted to evaluate the proposed techniques. Compared with VTM-11.0, our scheme achieves a new state-of-the-art, leading to {7.91%, 20.25%, 20.44%}, {11.64%, 26.40%, 26.50%}, and {10.97%, 26.63%, 26.77%} BD-rate reductions on average for {Y, Cb, Cr} under all-intra, random-access, and low-delay configurations, respectively. As far as we know, our proposed iDAM scheme provides the highest coding performance compared to all existing solutions. In addition, the syntax elements of the proposed scheme were adopted at the 76th meeting of Audio Video coding Standard (AVS) held this year.

Read full abstract

Video coding algorithms attempt to minimize the significant commonality that exists within a video sequence. Each new video coding standard contains tools that can perform this task more efficiently compared to its predecessors. Modern video coding systems are block-based wherein commonality modeling is carried out only from the perspective of the block that need be coded next. In this work, we argue for a commonality modeling approach that can provide a seamless blending between global and local homogeneity information in terms of motion. For this purpose, at first a prediction of the current frame, the frame that need be coded, is generated by performing a two-step discrete cosine basis oriented (DCO) motion modeling. The DCO motion model is employed rather than traditional translational or affine motion model since it has the ability to efficiently model complex motion fields by providing a smooth and sparse representation. Moreover, the proposed two-step motion modeling approach can yield better motion compensation at a reduced computational complexity since an informed guess is designed for initializing the motion search procedure. After that the current frame is partitioned into rectangular regions and the conformance of these regions to the learned motion model is investigated. Depending on the non-conformance to the estimated global motion model, an additional DCO motion model is introduced to increase the local motion homogeneity. In this way, the proposed approach generates a motion compensated prediction of the current frame through the minimization of both global and local motion commonality. Experimental results show an improved rate-distortion performance of a reference high efficiency video coding (HEVC) encoder, specifically up to around 9% savings in bit rate, that employs the DCO prediction frame as a reference frame for encoding the current frame. When compared to the more recent video coding standard, the versatile video coding (VVC) encoder, a bit rate savings of 2.37% is reported.

Read full abstract

Versatile Video Coding Research Articles

Related Topics

Articles published on Versatile Video Coding

Using Four Hypothesis Probability Estimators for CABAC in Versatile Video Coding

IDAM: Iteratively Trained Deep In-loop Filter with Adaptive Model Selection

Frequency-Based Adaptive Interpolation Filter in Intra Prediction

User perception for dynamic video resolution change using VVC

Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding

Decoder-Side Cross Resolution Synthesis for Video Compression Enhancement

Efficient VVC Intra Prediction Based on Deep Feature Fusion and Probability Estimation

High Quality Video Frames From VVC: A Deep Neural Network Approach

Joint Decision Tree and Visual Feature Rate Control Optimization for VVC UHD Coding.

Versatile Video Coding-Based Coding Tree Unit Level Image Compression With Dual Quantization Parameters for Hybrid Vision

Tree-Structured Data Clustering-Driven Neural Network for Intra Prediction in Video Coding.

Fast CU Decision Algorithm Based on Texture Complexity and CNN for VVC

A Two-Step Discrete Cosine Basis Oriented Motion Modeling Approach for Enhanced Motion Compensation.

A Super-Resolution-Based Feature Map Compression for Machine-Oriented Video Coding

Analysis of the Limitations of Further Improvement of the Efficiency of VVC-CABAC

SP-DSTS-MIMO Scheme-Aided H.266 for Reliable High Data Rate Mobile Video Communication

Reinforcement Learning for Rate-Distortion Optimized Hierarchical Prediction Structure

Learned Image Compression Using Cross-Component Attention Mechanism.

Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules.

Lenslet Image Coding With SAIs Synthesis via 3D CNNs-Based Reinforcement Learning With a Rate Reward

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Versatile Video Coding Research Articles

Related Topics

Articles published on Versatile Video Coding

Using Four Hypothesis Probability Estimators for CABAC in Versatile Video Coding

IDAM: Iteratively Trained Deep In-loop Filter with Adaptive Model Selection

Frequency-Based Adaptive Interpolation Filter in Intra Prediction

User perception for dynamic video resolution change using VVC

Partition Map Prediction for Fast Block Partitioning in VVC Intra-Frame Coding

Decoder-Side Cross Resolution Synthesis for Video Compression Enhancement

Efficient VVC Intra Prediction Based on Deep Feature Fusion and Probability Estimation

High Quality Video Frames From VVC: A Deep Neural Network Approach

Joint Decision Tree and Visual Feature Rate Control Optimization for VVC UHD Coding.

Versatile Video Coding-Based Coding Tree Unit Level Image Compression With Dual Quantization Parameters for Hybrid Vision

Tree-Structured Data Clustering-Driven Neural Network for Intra Prediction in Video Coding.

Fast CU Decision Algorithm Based on Texture Complexity and CNN for VVC

A Two-Step Discrete Cosine Basis Oriented Motion Modeling Approach for Enhanced Motion Compensation.

A Super-Resolution-Based Feature Map Compression for Machine-Oriented Video Coding

Analysis of the Limitations of Further Improvement of the Efficiency of VVC-CABAC

SP-DSTS-MIMO Scheme-Aided H.266 for Reliable High Data Rate Mobile Video Communication

Reinforcement Learning for Rate-Distortion Optimized Hierarchical Prediction Structure

Learned Image Compression Using Cross-Component Attention Mechanism.

Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules.

Lenslet Image Coding With SAIs Synthesis via 3D CNNs-Based Reinforcement Learning With a Rate Reward