Fast and Accurate Action Detection in Videos With Motion-Centric Attention Model

Jinzhuo Wang,Wen Gao,Wenmin Wang

doi:10.1109/tcsvt.2018.2887061

Abstract

A key factor that makes action detection in videos different from general video classification is human-guided clues, especially motion signals. Since not all the pixels in a video are informative for action recognition, the irrelevant and redundant parts can lead to a lot of noise and be burdensome for both feature extraction and classifier training. This encourages the researchers to seek out the design of the attentive model that can dynamically focus computations on the key spatiotemporal volumes. In this paper, we propose a motion-centric attention model for action detection in videos which imitates the human perception of saccade and fixation procedures while detecting actions in a video. Specifically, we first present a strategy to generate motion-centric locations based on the density peak of motion signals, providing reliable candidates around which actions have high possibilities to occur. Then, we introduce an attention model that conducts the saccade and fixation procedures on these candidates to observe local spatiotemporal visual information, preserve internal comprehension, and produce the action proposals on temporal bounds. Afterward, a classifier with several variants is prepared to classify the action proposals and decide which one to fixate and generate the final predictions. We show how to efficiently train our model to produce fast and accurate action detection, by scanning only a small fraction of locations in a video. The extensive experiments on three challenging datasets show promising results with both accuracy and speed.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast and Accurate Action Detection in Videos With Motion-Centric Attention Model

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Jan 1, 2020
Citations: 92

Similar Papers

A Proposal-Based Solution to Spatio-Temporal Action Detection in Untrimmed Videos
Joshua Gleason ... Steven Schwarcz
-
Joshua Gleason, et. al.Joshua Gleason ... Steven Schwarcz
01 Jan 2019
01 Jan 2019

GPU Based Bag of Feature for Fast Activity Detection in Video
Vikas Tripathi ... Durgaprasad Gangodkar
-
Vikas Tripathi, et. al.Vikas Tripathi ... Durgaprasad Gangodkar
01 Jan 2018
01 Jan 2018

Contextual Multi-scale Region Convolutional 3D Network for Anomalous Activity Detection in Videos
M Santhi ... Leya Elizabeth Sunny
-
M Santhi, et. al.M Santhi ... Leya Elizabeth Sunny
01 Jan 2020
01 Jan 2020

Spatial–Temporal Context-Aware Online Action Detection and Prediction
Jingjia Huang ... Ge Li
IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society | VOL. 30
Jingjia Huang, et. al.Jingjia Huang ... Ge Li
27 Jun 2019
IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast and Accurate Action Detection in Videos With Motion-Centric Attention Model

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society