Discriminative Action Snippet Propagation Network for Weakly Supervised Temporal Action Localization

Yuanjie Dang,Chunxia Huang,Ronghua Liang,Ruohong Huan,Dongdong Zhao,Peng Chen,Nan Gao

doi:10.1145/3643815

Abstract

Weakly supervised temporal action localization (WTAL) aims to classify and localize actions in untrimmed videos with only video-level labels. Recent studies have attempted to obtain more accurate temporal boundaries by exploiting latent action instances in ambiguous snippets or propagating representative action features. However, empirically handcrafted ambiguous snippet extraction and the imprecise alignment of representative snippet propagation lead to challenges in modeling the completeness of actions for these methods. In this article, we propose a Discriminative Action Snippet Propagation Network (DASP-Net) to accurately discover ambiguous snippets in videos and propagate discriminative instance-level features throughout the video for improving action completeness. Specifically, we introduce a novel discriminative feature propagation module for capturing the global contextual attention and propagating the action concept across the whole video by perceiving the discriminative action snippets with instance information from the same video. Simultaneously, we incorporate denoised pseudo-labels as supervision, where we correct the controversial prediction based on the feature space distribution during training, thereby alleviating false detection caused by noise background features. Furthermore, we design an ambiguous feature mining module, which maximizes the feature affinity information of action and background in ambiguous snippets to generate more accurate latent action and background snippets and learns more precise action instance boundaries through contrastive learning of action and background snippets. Extensive experiments show that DASP-Net achieves state-of-the-art results on THUMOS14 and ActivityNet1.2 datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discriminative Action Snippet Propagation Network for Weakly Supervised Temporal Action Localization

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Similar Papers

Dual-Feature Enhancement for Weakly Supervised Temporal Action Localization
Siying Liu ... Bin Liu
-
Siying Liu, et. al.Siying Liu ... Bin Liu
04 Jun 2023
04 Jun 2023

CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning
Can Zhang ... Yuexian Zou
-
Can Zhang, et. al.Can Zhang ... Yuexian Zou
01 Jun 2021
01 Jun 2021

Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks.
Ziyi Liu ... Le Wang
IEEE transactions on pattern analysis and machine intelligence | VOL. 44
Ziyi Liu, et. al.Ziyi Liu ... Le Wang
01 Jan 2020
IEEE transactions on pattern analysis and machine intelligence | VOL. 44

Enabling Weakly Supervised Temporal Action Localization From On-Device Learning of the Video Stream
Yue Tang ... Peipei Zhou
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 41
Yue Tang, et. al.Yue Tang ... Peipei Zhou
01 Nov 2022
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discriminative Action Snippet Propagation Network for Weakly Supervised Temporal Action Localization

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications