TEDdet: Temporal Feature Exchange and Difference Network for Online Real-Time Action Detection

Yu Liu,Dominique Ginhac,Fan Yang

doi:10.1109/access.2022.3164730

Abstract

Localizing and interpreting human actions in videos require understanding the spatial and temporal context of the scenes. Aside from accurate detection, vast sensing scenarios in the real-world also mandate incremental, instantaneous processing of scenes under restricted computational budgets. However, state-of-the-art detectors fail to meet the above criteria. The main challenge lies in their heavy architectural designs and detection pipelines to reason pertinent spatiotemporal information, such as incorporating 3D Convolutoinal Neural Networks (CNN) or extracting optical flow. With this insight, we propose a lightweight action tubelet detector coined TEDdet which unifies complementary feature aggregation and motion modeling modules. Specifically, our Temporal Feature Exchange module induces feature interaction by adaptively aggregating 2D CNN features over successive frames. To address actors’ location shift in the sequence, our Temporal Feature Difference module accumulates approximated pair-wise motion among target frames as trajectory cues. These modules can be easily integrated with an existing anchor-free detector to cooperatively model action instances’ categories, sizes, and movement for precise tubelet generation. TEDdet exploits larger temporal strides to efficiently infer actions in a coarse-to-fine and online manner. Without relying on 3D CNN or optical flow models, our detector demonstrates competitive accuracy at an unprecedented speed (89 FPS) that is more compliant with realistic applications. Codes will be available at <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/alphadadajuju/TEDdet</uri> .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

TEDdet: Temporal Feature Exchange and Difference Network for Online Real-Time Action Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Automatic annotation of human actions in video
Olivier Duchenne ... Jean Ponce
-
Olivier Duchenne, et. al.Olivier Duchenne ... Jean Ponce
01 Sep 2009
01 Sep 2009

Recognition of Action in Broadcast Basketball Videos on the Basis of Global and Local Pairwise Representation
Masaki Takahashi ... Mahito Fujii
-
Masaki Takahashi, et. al.Masaki Takahashi ... Mahito Fujii
01 Dec 2013
01 Dec 2013

Improved two-stream model for human action recognition
Yuxuan Zhao ... Sheng-Uei Guan
EURASIP Journal on Image and Video Processing | VOL. 2020
Yuxuan Zhao, et. al.Yuxuan Zhao ... Sheng-Uei Guan
17 Jun 2020
EURASIP Journal on Image and Video Processing | VOL. 2020

Video action recognition collaborative learning with dynamics via PSO-ConvNet Transformer
Huu Phong Nguyen ... Bernardete Ribeiro
Scientific Reports | VOL. 13
Huu Phong Nguyen, et. al.Huu Phong Nguyen ... Bernardete Ribeiro
05 Sep 2023
Scientific Reports | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TEDdet: Temporal Feature Exchange and Difference Network for Online Real-Time Action Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Access