Deep Motion Prior for Weakly-Supervised Temporal Action Localization.

Meng Cao,Mike Zheng Shou,Long Chen,Yuexian Zou,Can Zhang

doi:10.1109/tip.2022.3193752

Abstract

Weakly-Supervised Temporal Action Localization (WSTAL) aims to localize actions in untrimmed videos with only video-level labels. Currently, most state-of-the-art WSTAL methods follow a Multi-Instance Learning (MIL) pipeline: producing snippet-level predictions first and then aggregating to the video-level prediction. However, we argue that existing methods have overlooked two important drawbacks: 1) inadequate use of motion information and 2) the incompatibility of prevailing cross-entropy training loss. In this paper, we analyze that the motion cues behind the optical flow features are complementary informative. Inspired by this, we propose to build a context-dependent motion prior, termed as motionness. Specifically, a motion graph is introduced to model motionness based on the local motion carrier (e.g., optical flow). In addition, to highlight more informative video snippets, a motion-guided loss is proposed to modulate the network training conditioned on motionness scores. Extensive ablation studies confirm that motionness efficaciously models action-of-interest, and the motion-guided loss leads to more accurate results. Besides, our motion-guided loss is a plug-and-play loss function and is applicable with existing WSTAL methods. Without loss of generality, based on the standard MIL pipeline, our method achieves new state-of-the-art performance on three challenging benchmarks, including THUMOS'14, ActivityNet v1.2 and v1.3.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Motion Prior for Weakly-Supervised Temporal Action Localization.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Journal: IEEE Transactions on Image Processing	Publication Date: Jan 1, 2022
Citations: 14

Similar Papers

CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning
Can Zhang ... Yuexian Zou
-
Can Zhang, et. al.Can Zhang ... Yuexian Zou
01 Jun 2021
01 Jun 2021

Temporal Feature Enhancement Dilated Convolution Network for Weakly-supervised Temporal Action Localization
Jianxiong Zhou ... Ying Wu
-
Jianxiong Zhou, et. al.Jianxiong Zhou ... Ying Wu
01 Jan 2023
01 Jan 2023

Discriminative Action Snippet Propagation Network for Weakly Supervised Temporal Action Localization
Yuanjie Dang ... Nan Gao
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 20
Yuanjie Dang, et. al.Yuanjie Dang ... Nan Gao
08 Mar 2024
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 20

Dual-Feature Enhancement for Weakly Supervised Temporal Action Localization
Siying Liu ... Bin Liu
-
Siying Liu, et. al.Siying Liu ... Bin Liu
04 Jun 2023
04 Jun 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Motion Prior for Weakly-Supervised Temporal Action Localization.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing