Direct Video Frame Interpolation With Multiple Latent Encoders

Yong-Hoon Kwon,Min-Gyu Park,Ju Hong Yoon

doi:10.1109/access.2021.3053695

Yong-Hoon Kwon, Min-Gyu Park + Show 1 more

Open Access

https://doi.org/10.1109/access.2021.3053695

Copy DOI

Abstract

We present a simple but effective video interpolation framework that can be applied to various types of videos including conventional videos and 360° videos. Our main idea is to predict the latent feature of an intermediate frame, through the latent feature encoders between encoder and decoder networks, without explicitly computing optical flow or depth maps. The latent feature encoders take latent features of input images and then predict the latent feature of a target image, i . e . an intermediate frame. Afterward, the decoder network reconstructs the target image from the latent feature. The proposed framework consists of fully convolutional networks, and it is therefore end-to-end trainable from scratch without requiring additional information except for consecutive frames. We experimentally verify the superiority of proposed method by comparing it to state-of-the-art methods with various types of datasets. Moreover, an ablation study is carried out to analyze the key components of the proposed method. Our proposed method performs interpolation in latent domain, it is advantageous to apply various video interpolation ( e . g . NIR and depth videos) without limiting the type of input data.

Highlights

Recent advances in deep learning have significantly improved the performance of video interpolation, and the state-ofthe-art methods show promising results on the benchmark datasets [1], [2]
We split the previous studies into two groups where the first group explicitly utilizes optical flow information as guiding information, but the second approach directly predicts an intermediate frame without additional information
In this work, we propose a fully convolutional video interpolation framework that can be trained for arbitrary videos

Summary

Introduction

Recent advances in deep learning have significantly improved the performance of video interpolation, and the state-ofthe-art methods show promising results on the benchmark datasets [1], [2]. This technology can be applied to various applications including frame rate up-conversion [3], video compression [4], view synthesis [5], [6], and motion deblurring [7]. We split the previous studies into two groups where the first group explicitly utilizes optical flow information as guiding information, but the second approach directly predicts an intermediate frame without additional information. For the sake of clarity, we call the first approach as a guided approach and the second approach as a direct approach

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 33	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Direct Video Frame Interpolation With Multiple Latent Encoders

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Self-Supervised Learning of Optical Flow, Depth, Camera Pose and Rigidity Segmentation with Occlusion Handling
Rokia Abdein ... Ning Lv
-
Rokia Abdein, et. al.Rokia Abdein ... Ning Lv
16 Oct 2022
16 Oct 2022

Automatic Extraction of Water and Shadow from SAR Images Based on a Multi-Resolution Dense Encoder and Decoder Network.
Peng Zhang ... Jin Xing
Sensors | VOL. 19
Peng Zhang, et. al.Peng Zhang ... Jin Xing
16 Aug 2019
Sensors | VOL. 19

Real-time generation of three-dimensional motion fields
H Niitsuma ... T Maruyama
-
H Niitsuma, et. al.H Niitsuma ... T Maruyama
10 Oct 2005
10 Oct 2005

Manipulated Face Detection and Localization Based on Semantic Segmentation
Gen Li ... Chengqiao Hu
-
Gen Li, et. al.Gen Li ... Chengqiao Hu
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Direct Video Frame Interpolation With Multiple Latent Encoders

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access