Dual-Feature Enhancement for Weakly Supervised Temporal Action Localization

Siying Liu,Qiankun Liu,Nenghai Yu,Qi Chu,Bin Liu

doi:10.1109/icassp49357.2023.10096383

Abstract

Weakly-supervised Temporal Action Localization (WTAL) aims at localizing actions in untrimmed videos with only video-level labels. Most existing methods embrace a "localization by classification" paradigm and adopt a model that pre-trained with recognition task for feature extraction. The gap between recognition and localization tasks leads to inferior performance. Some recent works attempt to utilize feature enhancement to obtain better feature for localization and boost the performance to some extent. However, they are limited to intra-video information exploiting, while ignoring meaningful inter-video information in the dataset. In this paper, we propose a novel Dual-Feature Enhancement (DFE) method for WTAL, which can utilize both intra-and inter-video information. For intra-video, a local feature enhancement module is designed to promote the feature interaction along the temporal dimension within each video. For inter-video information, a global memory module is firstly designed to learn the representations for different categories across different videos. Then, a global feature enhancement module is used to enhance the video features with the help of those global representations in the memory. Besides, to reduce the extra computational cost caused by global enhancement module in the inference stage, a distillation loss is applied to enforce the local branch to learn the information from global branch, so the global enhancement module could be removed during inference. The proposed method achieves state-of-the-art performance on popular benchmarks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dual-Feature Enhancement for Weakly Supervised Temporal Action Localization

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Discriminative Action Snippet Propagation Network for Weakly Supervised Temporal Action Localization
Yuanjie Dang ... Nan Gao
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 20
Yuanjie Dang, et. al.Yuanjie Dang ... Nan Gao
08 Mar 2024
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 20

CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning
Can Zhang ... Yuexian Zou
-
Can Zhang, et. al.Can Zhang ... Yuexian Zou
01 Jun 2021
01 Jun 2021

Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization
Weiqi Sun ... Dong Xu
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33
Weiqi Sun, et. al.Weiqi Sun ... Dong Xu
01 Jan 2023
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33

Enabling Weakly Supervised Temporal Action Localization From On-Device Learning of the Video Stream
Yue Tang ... Peipei Zhou
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 41
Yue Tang, et. al.Yue Tang ... Peipei Zhou
01 Nov 2022
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 41

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dual-Feature Enhancement for Weakly Supervised Temporal Action Localization

Abstract

Talk to us

Similar Papers