Contraction of Dynamically Masked Deep Neural Networks for Efficient Video Processing

Bodo Rueckauer,Shih-Chii Liu

doi:10.1109/tcsvt.2021.3066241

Abstract

Sequential data such as video are characterized by spatio-temporal redundancies. As of yet, few deep learning algorithms exploit them to decrease the often massive cost during inference. This work leverages correlations in video data to reduce the size and run-time cost of deep neural networks. Drawing upon the simplicity of the typically used ReLU activation function, we replace this function by dynamically updating masks. The resulting network is a simple chain of matrix multiplications and bias additions, which can be contracted into a single weight matrix and bias vector. Inference then reduces to an affine transformation of the input sample with these contracted parameters. We show that the method is akin to approximating the neural network with a first-order Taylor expansion around a dynamically updating reference point. For triggering these updates, one static and three data-driven mechanisms are analyzed. We evaluate the proposed algorithm on a range of tasks, including pose estimation on surveillance data, road detection on KITTI driving scenes, object detection on ImageNet videos, as well as denoising MNIST digits, and obtain compression rates up to <inline-formula> <tex-math notation="LaTeX">$3.6\times $ </tex-math></inline-formula>.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Feb 1, 2022
Citations: 3	License type: other-oa

R Discovery Prime

R Discovery Prime

Contraction of Dynamically Masked Deep Neural Networks for Efficient Video Processing

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Similar Papers

Linear Approximation of Deep Neural Networks for Efficient Inference on Video Data
Bodo Rueckauer ... Shih-Chii Liu
-
Bodo Rueckauer, et. al.Bodo Rueckauer ... Shih-Chii Liu
01 Sep 2019
01 Sep 2019

Event-Based Vision Processing in Deep Neural Networks

-

04 Mar 2021
04 Mar 2021

CMOS Implementations of Rectified Linear Activation Function
P Priyanka ... G K Nisarga
-
P Priyanka, et. al.P Priyanka ... G K Nisarga
01 Jan 2019
01 Jan 2019

An investigation on deep learning with beta stabilizer
Qi Liu ... Tian Tan
-
Qi Liu, et. al.Qi Liu ... Tian Tan
01 Nov 2016
01 Nov 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Contraction of Dynamically Masked Deep Neural Networks for Efficient Video Processing

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society