Deep Network-Based Frame Extrapolation With Reference Frame Alignment

Shuai Huo,Siwei Ma,Dong Liu,Wen Gao,Bin Li,Feng Wu

doi:10.1109/tcsvt.2020.2995243

Abstract

Frame extrapolation is to predict future frames from the past (reference) frames, which has been studied intensively in the computer vision research and has great potential in video coding. Recently, a number of studies have been devoted to the use of deep networks for frame extrapolation, which achieves certain success. However, due to the complex and diverse motion patterns in natural video, it is still difficult to extrapolate frames with high fidelity directly from reference frames. To address this problem, we introduce reference frame alignment as a key technique for deep network-based frame extrapolation. We propose to align the reference frames, e.g. using block-based motion estimation and motion compensation, and then to extrapolate from the aligned frames by a trained deep network. Since the alignment, a preprocessing step, effectively reduces the diversity of network input, we observe that the network is easier to train and the extrapolated frames are of higher quality. We verify the proposed technique in video coding, using the extrapolated frame for inter prediction in High Efficiency Video Coding (HEVC) and Versatile Video Coding (VVC). We investigate different schemes, including whether to align between the target frame and the reference frames, and whether to perform motion estimation on the extrapolated frame. We conduct a comprehensive set of experiments to study the efficiency of the proposed method and to compare different schemes. Experimental results show that our proposal achieves on average 5.3% and 2.8% BD-rate reduction in Y component compared to HEVC, under low-delay P and low-delay B configurations, respectively. Our proposal performs much better than the frame extrapolation without reference frame alignment.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Network-Based Frame Extrapolation With Reference Frame Alignment

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: May 29, 2020
Citations: 47

Similar Papers

A generative adversarial network for video compression
Pengli Du ... Fauzia Ahmad
-
Pengli Du, et. al.Pengli Du ... Fauzia Ahmad
31 May 2022
31 May 2022

Performance Comparison of Emerging EVC and VVC Video Coding Standards with HEVC and AV1
Dan Grois ... Kwang Pyo Choi
SMPTE Motion Imaging Journal | VOL. 130
Dan Grois, et. al.Dan Grois ... Kwang Pyo Choi
01 May 2021
SMPTE Motion Imaging Journal | VOL. 130

Versatile Video Coding
K.R Rao ... Humberto Ochoa Dominguez
-
K.R Rao, et. al.K.R Rao ... Humberto Ochoa Dominguez
01 Sep 2022
01 Sep 2022

Reference Clip for Inter Prediction in Video Coding
Changyue Ma ... Dong Liu
IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society | VOL. 29
Changyue Ma, et. al.Changyue Ma ... Dong Liu
12 Oct 2018
IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Network-Based Frame Extrapolation With Reference Frame Alignment

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society