Deep Learning in Latent Space for Video Prediction and Compression

Bowen Liu,Shiyu Liu,Hun-Seok Kim,Yu Chen

doi:10.1109/cvpr46437.2021.00076

Abstract

Learning-based video compression has achieved substantial progress during recent years. The most influential approaches adopt deep neural networks (DNNs) to remove spatial and temporal redundancies by finding the appropriate lower-dimensional representations of frames in the video. We propose a novel DNN based framework that predicts and compresses video sequences in the latent vector space. The proposed method first learns the efficient lower-dimensional latent space representation of each video frame and then performs inter-frame prediction in that latent domain. The proposed latent domain compression of individual frames is obtained by a deep autoencoder trained with a generative adversarial network (GAN). To exploit the temporal correlation within the video frame sequence, we employ a convolutional long short-term memory (ConvLSTM) network to predict the latent vector representation of the future frame. We demonstrate our method with two applications; video compression and abnormal event detection that share the identical latent frame prediction network. The proposed method exhibits superior or competitive performance compared to the state-of-the-art algorithms specifically designed for either video compression or anomaly detection. <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup>

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Learning in Latent Space for Video Prediction and Compression

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Dual distribution matching GAN
Zhiwen Zuo ... Dongming Lu
Neurocomputing | VOL. 478
Zhiwen Zuo, et. al.Zhiwen Zuo ... Dongming Lu
03 Jan 2022
Neurocomputing | VOL. 478

Express Construction for GANs from Latent Representation to Data Distribution
Minghui Liu ... Ming Liu
Applied Sciences | VOL. 12
Minghui Liu, et. al.Minghui Liu ... Ming Liu
13 Apr 2022
Applied Sciences | VOL. 12

A Generative Neural Network for Maximizing Fitness and Diversity of Synthetic DNA and Protein Sequences.
Johannes Linder ... Georg Seelig
Cell Systems | VOL. 11
Johannes Linder, et. al.Johannes Linder ... Georg Seelig
25 Jun 2020
Cell Systems | VOL. 11

Using deep LSD to build operators in GANs latent space with meaning in real space.
J Quetzalcóatl Toledo-Marín ... James A Glazier
PloS one | VOL. 18
J Quetzalcóatl Toledo-Marín, et. al.J Quetzalcóatl Toledo-Marín ... James A Glazier
29 Jun 2023
PloS one | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Learning in Latent Space for Video Prediction and Compression

Abstract

Talk to us

Similar Papers