DeepBinaryMask: Learning a binary mask for video compressive sensing

Michael Iliadis,Leonidas Spinoulas,Aggelos K Katsaggelos

doi:10.1016/j.dsp.2019.102591

Abstract

In this paper, we propose an encoder-decoder neural network model referred to as DeepBinaryMask for video compressive sensing. In video compressive sensing one frame is acquired using a set of coded masks (sensing matrix) from which a number of video frames, equal to the number of coded masks, is reconstructed. The proposed framework is an end-to-end model where the sensing matrix is trained along with the video reconstruction. The encoder maps a video block to compressive measurements by learning the binary elements of the sensing matrix. The decoder is trained to map the measurements from a video patch back to a video block via several hidden layers of a Multi-Layer Perceptron network. The predicted video blocks are stacked together to recover the unknown video sequence. The reconstruction performance is found to improve when using the trained sensing mask from the network as compared to other mask designs such as random, across a wide variety of compressive sensing reconstruction algorithms. Finally, our analysis and discussion offers insights into understanding the characteristics of the trained mask designs that lead to the improved reconstruction quality.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DeepBinaryMask: Learning a binary mask for video compressive sensing

Abstract

Talk to us

Similar Papers

More From: Digital Signal Processing

Lead the way for us

Journal: Digital Signal Processing	Publication Date: Oct 7, 2019
Citations: 62

Similar Papers

Continuous Wavelet Transform Based Gene Optimized Fuzzy C-Means Clustering For Forest Fire Detection
-
International Journal of Innovative Technology and Exploring Engineering | VOL. 8
--
05 Sep 2019
International Journal of Innovative Technology and Exploring Engineering | VOL. 8

Morlet Wavelet Threshold Based Glow Warm Optimized X-Means Clustering for Forest Fire Detection
B Pushpa ... M Kamarasan
-
B Pushpa, et. al.B Pushpa ... M Kamarasan
01 Jan 2020
01 Jan 2020

Variable Temporal Length Training for Action Recognition CNNs.
Tan-Kun Li ... Tardi Tjahjadi
Sensors (Basel, Switzerland) | VOL. 24
Tan-Kun Li, et. al.Tan-Kun Li ... Tardi Tjahjadi
25 May 2024
Sensors (Basel, Switzerland) | VOL. 24

Three-dimensional video compression using subband/wavelet transform with lower buffering requirements
H Khalil ... A.F Atiya
IEEE Transactions on Image Processing | VOL. 8
H Khalil, et. al.H Khalil ... A.F Atiya
01 Jun 1999
IEEE Transactions on Image Processing | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DeepBinaryMask: Learning a binary mask for video compressive sensing

Abstract

Talk to us

Similar Papers

More From: Digital Signal Processing