Learning from Weakly-Labeled Web Videos via Exploring Sub-concepts

Kunpeng Li,Tomas Pfister,Yun Fu,Xuehan Xiong,Guanhang Wu,Chen-Yu Lee,Zizhao Zhang,Zhichao Lu

doi:10.1609/aaai.v36i2.20022

Abstract

Learning visual knowledge from massive weakly-labeled web videos has attracted growing research interests thanks to the large corpus of easily accessible video data on the Internet. However, for video action recognition, the action of interest might only exist in arbitrary clips of untrimmed web videos, resulting in high label noises in the temporal space. To address this challenge, we introduce a new method for pre-training video action recognition models using queried web videos. Instead of trying to filter out potential noises, we propose to provide fine-grained supervision signals by defining the concept of Sub-Pseudo Label (SPL). Specifically, SPL spans out a new set of meaningful "middle ground" label space constructed by extrapolating the original weak labels during video querying and the prior knowledge distilled from a teacher model. Consequently, SPL provides enriched supervision for video models to learn better representations and improves data utilization efficiency of untrimmed videos. We validate the effectiveness of our method on four video action recognition datasets and a weakly-labeled image dataset. Experiments show that SPL outperforms several existing pre-training strategies and the learned representations lead to competitive results on several benchmarks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning from Weakly-Labeled Web Videos via Exploring Sub-concepts

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 28, 2022
Citations: 3

Similar Papers

Action Recognition in Untrimmed Videos with Composite Self-attention Two-Stream Framework
Dong Cao ... Haibo Chen
-
Dong Cao, et. al.Dong Cao ... Haibo Chen
01 Jan 2020
01 Jan 2020

Learning Transferable Self-Attentive Representations for Action Recognition in Untrimmed Videos with Weak Supervision
Xiao-Yu Zhang ... Changsheng Li
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33
Xiao-Yu Zhang, et. al.Xiao-Yu Zhang ... Changsheng Li
17 Jul 2019
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33

Audio and Video Feature Fusion for Activity Recognition in Unconstrained Videos
José Lopes ... Sameer Singh
-
José Lopes, et. al.José Lopes ... Sameer Singh
01 Jan 2006
01 Jan 2006

Localizing the Common Action Among a Few Videos
Pengwan Yang ... Cees G M Snoek
-
Pengwan Yang, et. al.Pengwan Yang ... Cees G M Snoek
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning from Weakly-Labeled Web Videos via Exploring Sub-concepts

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence