A Video Prediction Method Based on Optical Flow Estimation and Pixel Generation

Wei Lu,Yanshuo Chang,Longmei Zhang,Junyun Cui

doi:10.1109/access.2021.3096788

Abstract

Video prediction has developed rapidly after the booming of deep learning. As an important part of unsupervised representation learning, it plays an important role in anomalous behavior detection, autonomous driving, video games, and other fields. However, the prediction method based on optical flow estimation is susceptible to brightness change and camera shake, and it is difficult to predict occluded objects. While the prediction method based on pixel generation is difficult to fit ambiguous and complex scenes, which leads to a blurry prediction. In this work, we proposed an end-to-end video prediction framework that combines the optical flow estimation module with the pixel generation module by a learnable mask weight to predict high-fidelity videos. To further improve the prediction effect, we introduce adversarial training to the framework. We introduced a frame discriminator and a sequence discriminator to ensure the consistency of the spatio-temporal distribution of predicted video frames and real video frames. The results of experiments on challenging datasets demonstrate the practicability and effectiveness of our proposed video prediction framework. On the one hand, our proposed framework has achieved an equal quality compared with the current latest model, which requires fewer parameters and has a faster prediction speed. On the other hand, the results of ablation experiments demonstrate the effect of fusing different modules and the effectiveness of adversarial training.

Highlights

With the wide use of different sensors, devices, and the Internet in society, the era of Internet of Things is on the horizon
To compare the performance of the proposed model with related works, three types of representative models of different video prediction methods are used for comparison: (1) Models based on pixel generation: BeyondMSE[11], PredNet[8], CycleGAN[13], ContextVP[9]
The fusion of the optical flow estimation module and the pixel generation module can greatly improve the prediction effect, which verifies that the weighted fusion of two different modules with a learnable mask can better complement each other

Summary

A Video Prediction Method Based on Optical Flow Estimation and Pixel Generation

This work was supported in part by the Science Foundation of The China(Xi'an) Institute for Silk Road Research (2019YA07, and 2019YB05),National Statistical Science Research Project(2016LY59) , and in part by the Research Foundation of Xi'an University of Finance and Economics under Grant 18FCJH02.

INTRODUCTION

RELATED WORK

PIXEL GENERATION

FUSION OF OPTIMAL FLOW and PIXEL GENERATION

ADVERSARIAL TRAINING

EXPERIMENTS AND ANALYSIS

EVALUATION METRICS

Methods

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Video Prediction Method Based on Optical Flow Estimation and Pixel Generation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Video Frame Prediction by Joint Optimization of Direct Frame Synthesis and Optical-Flow Estimation
Navin Ranjan ... Yeong-Chan Kim
Computers, Materials & Continua | VOL. 75
Navin Ranjan, et. al.Navin Ranjan ... Yeong-Chan Kim
01 Jan 2023
Computers, Materials & Continua | VOL. 75

Unsupervised Transfer Learning For Video Prediction Based on Generative Adversarial Network
Jiwen Shi ... Jun Wu
-
Jiwen Shi, et. al.Jiwen Shi ... Jun Wu
26 Nov 2021
26 Nov 2021

Video frame interpolation via optical flow estimation with image inpainting
Xiaozhang Liu ... Yuxiu Lin
International Journal of Intelligent Systems | VOL. 35
Xiaozhang Liu, et. al.Xiaozhang Liu ... Yuxiu Lin
04 Sep 2020
International Journal of Intelligent Systems | VOL. 35

Dual Motion GAN for Future-Flow Embedded Video Prediction
Xiaodan Liang ... Eric P Xing
-
Xiaodan Liang, et. al.Xiaodan Liang ... Eric P Xing
01 Oct 2017
01 Oct 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Video Prediction Method Based on Optical Flow Estimation and Pixel Generation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access