Abstract

This paper presents a solution to the video prediction problem based on the deep learning paradigm. Predicted frames generated by existing video prediction models are often blurry and have difficulty maintaining accuracy in multi-step prediction. To overcome these limitations, this paper presents a deep learning model, named the deep pixel restoration loop coding network (DPR-LC-Net), which employs the concept of predictive coding and adopts pixels in the real frames. While making a long-term prediction, it can generate clear prediction frames with few errors. The calculation process of DPR-LC-Net is multi-sequential: it completes calculations in the form of an approximate loop from top to bottom and then from left to right. After predicting subsequent steps, DPR-LC-Net both calculates errors and removes them from sequential prediction. Finally, the model includes a unique pixel restoration module that works efficiently on pixels in the preceding real frames to generate predicted frames, thereby improving the clarity of the predicted frames. Extensive experiments using four video datasets illustrate that the prediction performance of DPR-LC-Net is superior to that of the state-of-the-art models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.