Neural Reference Synthesis for Inter Frame Coding

Dandan Ding,Xiang Gao,Chenran Tang,Zhan Ma

doi:10.1109/tip.2021.3134465

Abstract

This work proposes the neural reference synthesis (NRS) to generate high-fidelity reference block for motion estimation and motion compensation (MEMC) in inter frame coding. The NRS is comprised of two submodules: one for reconstruction enhancement and the other for reference generation. Although numerous methods have been developed in the past for these two submodules using either handcrafted rules or deep convolutional neural network (CNN) models, they basically deal with them separately, resulting in limited coding gains. By contrast, the NRS proposes to optimize them collaboratively. It first develops two CNN-based models, namely EnhNet and GenNet. The EnhNet only uses spatial correlations within the current frame for reconstruction enhancement and the GenNet is then augmented by further aggregating temporal correlations across multiple frames for reference synthesis. However, a direct concatenation of EnhNet and GenNet without considering the complex temporal reference dependency across inter frames would implicitly induce iterative CNN processing and cause the data overfitting problem, leading to visually-disturbing artifacts and oversmoothed pixels. To tackle this problem, the NRS applies a new training strategy to coordinate the EnhNet and GenNet for more robust and generalizable models, and also devises a lightweight multi-level R-D (rate-distortion) selection policy for the encoder to adaptively choose reference blocks generated from the proposed NRS model or conventional coding process. Our NRS not only offers state-of-the-art coding gains, e.g., >10% BD-Rate (Bjøntegaard Delta Rate) reduction against the High Efficiency Video Coding (HEVC) anchor for a variety of common test video sequences encoded at a wide bit range in both low-delay and random access settings, but also greatly reduces the complexity relative to existing learning-based methods by utilizing more lightweight DNNs. All models are made publicly accessible at https://github.com/IVC-Projects/NRS for reproducible research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Neural Reference Synthesis for Inter Frame Coding

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Journal: IEEE Transactions on Image Processing	Publication Date: Jan 1, 2022
Citations: 5

Similar Papers

A comparative evaluation of deep convolutional neural network and deep neural network-based land use/land cover classifications of mining regions using fused multi-sensor satellite data
Ajay Kumar ... Amit Kumar Gorai
Advances in Space Research | VOL. 72
Ajay Kumar, et. al.Ajay Kumar ... Amit Kumar Gorai
04 Sep 2023
Advances in Space Research | VOL. 72

Abstract 1394: Diagnosis of thyroid cancer using deep convolutional neural network models applied to sonographic images from clinical ultrasound exams
Xiangchun Li ...
Cancer Research | VOL. 79
Xiangchun Li, et. al.Xiangchun Li ...
01 Jul 2019
Cancer Research | VOL. 79

Consistent Quality Oriented Rate Control in HEVC Via Balancing Intra and Inter Frame Coding
Wei Gao ... Ronggang Wang
IEEE Transactions on Industrial Informatics | VOL. 18
Wei Gao, et. al.Wei Gao ... Ronggang Wang
01 Mar 2022
IEEE Transactions on Industrial Informatics | VOL. 18

Development of a Novel Deep Convolutional Neural Network Model for Early Detection of Brain Stroke Using CT Scan Images
Tariq Ahmad ... Sadique Ahmad
-
Tariq Ahmad, et. al.Tariq Ahmad ... Sadique Ahmad
28 Sep 2023
28 Sep 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neural Reference Synthesis for Inter Frame Coding

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing