FrameExit: Conditional Early Exiting for Efficient Video Recognition

Amir Ghodrati,Amirhossein Habibian,Babak Ehteshami Bejnordi

doi:10.1109/cvpr46437.2021.01535

Abstract

In this paper, we propose a conditional early exiting framework for efficient video recognition. While existing works focus on selecting a subset of salient frames to re-duce the computation costs, we propose to use a simple sampling strategy combined with conditional early exiting to enable efficient recognition. Our model automatically learns to process fewer frames for simpler videos and more frames for complex ones. To achieve this, we employ a cascade of gating modules to automatically determine the earliest point in processing where an inference is sufficiently reliable. We generate on-the-fly supervision signals to the gates to provide a dynamic trade-off between accuracy and computational cost. Our proposed model outperforms competing methods on three large-scale video benchmarks. In particular, on ActivityNet1.3 and mini-kinetics, we outperform the state-of-the-art efficient video recognition methods with 1.3× and 2.1 less GFLOPs, respectively. Addition-ally, our method sets× a new state of the art for efficient video understanding on the HVU benchmark.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FrameExit: Conditional Early Exiting for Efficient Video Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition
Zejia Weng ... Yu-Gang Jiang
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 20
Zejia Weng, et. al.Zejia Weng ... Yu-Gang Jiang
27 Sep 2023
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 20

A Dynamic Frame Selection Framework for Fast Video Recognition.
Zuxuan Wu ... Hengduo Li
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44
Zuxuan Wu, et. al.Zuxuan Wu ... Hengduo Li
07 Oct 2020
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44

AdaFrame: Adaptive Frame Selection for Fast Video Recognition
Zuxuan Wu ... Chih-Yao Ma
-
Zuxuan Wu, et. al.Zuxuan Wu ... Chih-Yao Ma
01 Jun 2019
01 Jun 2019

Deformable Video Transformer
Jue Wang ... Lorenzo Torresani
-
Jue Wang, et. al.Jue Wang ... Lorenzo Torresani
01 Jun 2022
01 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FrameExit: Conditional Early Exiting for Efficient Video Recognition

Abstract

Talk to us

Similar Papers