Multiple Temporal Pooling Mechanisms for Weakly Supervised Temporal Action Localization

Peng Dou,Haifeng Hu,Ying Zeng,Zhuoqun Wang

doi:10.1145/3567828

Abstract

Recent action localization works learn in a weakly supervised manner to avoid the expensive cost of human labeling. Those works are mostly based on the Multiple Instance Learning framework, where temporal pooling is an indispensable part that usually relies on the guidance of snippet-level Class Activation Sequences (CAS) . However, we observe that previous works only leverage a simple convolutional neural network for the generation of CAS, which ignores the weak discriminative foreground action segments and the background ones, and meanwhile, the relationship between different actions has not been considered. To solve this problem, we propose multiple temporal pooling mechanisms (MTP) for a more sufficient information utilization. Specifically, with the design of the Foreground Variance Branch, Dual Foreground Attention Branch and Hybrid Attention Fine-tuning Branch, MTP can leverage more effective information from different aspects and generate different CASs to guide the learning of temporal pooling. Moreover, different loss functions are designed for a better optimization of individual branches, aiming to effectively distinguish the action from the background. Our method shows excellent results on the THUMOS14 and ActivityNet1.2 datasets.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multiple Temporal Pooling Mechanisms for Weakly Supervised Temporal Action Localization

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Journal: ACM Transactions on Multimedia Computing, Communications, and Applications	Publication Date: Feb 25, 2023
Citations: 2

Similar Papers

Multiple instance learning framework can facilitate explainability in murmur detection.
Maurice Rohr ... Dukyong Yoon
PLOS Digital Health | VOL. 3
Maurice Rohr, et. al.Maurice Rohr ... Dukyong Yoon
19 Mar 2024
PLOS Digital Health | VOL. 3

A Comparative Evaluation Of Temporal Pooling Methods For Blind Video Quality Assessment
Zhengzhong Tu ... Alan C Bovik
-
Zhengzhong Tu, et. al.Zhengzhong Tu ... Alan C Bovik
01 Oct 2020
01 Oct 2020

Weakly Supervised Action Localization by Sparse Temporal Pooling Network
Phuc Nguyen ... Gautam Prasad
-
Phuc Nguyen, et. al.Phuc Nguyen ... Gautam Prasad
01 Jun 2018
01 Jun 2018

Brick Assembly Networks: An Effective Network for Incremental Learning Problems
Jiacang Ho ... Dae-Ki Kang
Electronics | VOL. 9
Jiacang Ho, et. al.Jiacang Ho ... Dae-Ki Kang
17 Nov 2020
Electronics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiple Temporal Pooling Mechanisms for Weakly Supervised Temporal Action Localization

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications