Future video frame prediction based on generative motion-assistant discriminative network

Chenming Li,Xiuhong Chen

doi:10.1016/j.asoc.2023.110028

Abstract

With the continuous development of deep learning, video frame prediction has become a hotspot in the field of computer vision due to its wide range of applications in anomaly detection, robot decision-making, weather forecasting, and autonomous driving. Although current video frame prediction methods have made remarkable progress, the majority of them directly generate prediction frames by extracting potential spatial distribution patterns from the video data. They lack spatiotemporal information modeling, which leads to high latency, ambiguity, and unrealistic results. In this work, we propose an end-to-end video prediction network model (Generative Differential-Assisted Discriminative Network, abbreviated as GDDNet). It combines the advantages of the difference generation method to extract short-term variations from the image and attention mechanisms to recall global contextual motion information. Furthermore, the differential attention mechanism (DAM) module can guide the model to allocate attention resources more efficiently. These strategies considerably improve the model’s ability to represent motion features in video frames. To further optimize the prediction effect, we introduce adversarial training to enhance the clarity and authenticity of the video frames. In order to ensure the consistency of spatiotemporal distribution between predicted and real frames, we introduce a sequential frame discriminator. Experimental results on the KITTI, UCF-101, and Caltech pedestrian datasets demonstrate the effectiveness of the GDDNet and compare it to the state-of-the-art model. Multi-frame prediction and ablation experiments show that our proposed model not only improves the quality of predictions, but also provides a more flexible prediction framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Future video frame prediction based on generative motion-assistant discriminative network

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Journal: Applied Soft Computing	Publication Date: Jan 18, 2023
Citations: 3

Similar Papers

Video Frame Prediction by Deep Multi-Branch Mask Network
Sen Li ... Jianru Xue
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 31
Sen Li, et. al.Sen Li ... Jianru Xue
10 Apr 2020
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 31

Video Frame Prediction by Joint Optimization of Direct Frame Synthesis and Optical-Flow Estimation
Navin Ranjan ... Yeong-Chan Kim
Computers, Materials & Continua | VOL. 75
Navin Ranjan, et. al.Navin Ranjan ... Yeong-Chan Kim
01 Jan 2023
Computers, Materials & Continua | VOL. 75

Video frame prediction with dual-stream deep network emphasizing motions and content details
Qingming Huang ... Tianyi Yang
Applied Soft Computing | VOL. 125
Qingming Huang, et. al.Qingming Huang ... Tianyi Yang
18 Jun 2022
Applied Soft Computing | VOL. 125

Video Frame Prediction from a Single Image and Events
Juanjuan Zhu ... Yuchao Dai
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Juanjuan Zhu, et. al.Juanjuan Zhu ... Yuchao Dai
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Future video frame prediction based on generative motion-assistant discriminative network

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing