Temporal teacher with masked transformers for semi-supervised action proposal generation

Selen Pehlivan,Jorma Laaksonen

doi:10.1007/s00138-024-01521-7

Abstract

By conditioning on unit-level predictions, anchor-free models for action proposal generation have displayed impressive capabilities, such as having a lightweight architecture. However, task performance depends significantly on the quality of data used in training, and most effective models have relied on human-annotated data. Semi-supervised learning, i.e., jointly training deep neural networks with a labeled dataset as well as an unlabeled dataset, has made significant progress recently. Existing works have either primarily focused on classification tasks, which may require less annotation effort, or considered anchor-based detection models. Inspired by recent advances in semi-supervised methods on anchor-free object detectors, we propose a teacher-student framework for a two-stage action detection pipeline, named Temporal Teacher with Masked Transformers (TTMT), to generate high-quality action proposals based on an anchor-free transformer model. Leveraging consistency learning as one self-training technique, the model jointly trains an anchor-free student model and a gradually progressing teacher counterpart in a mutually beneficial manner. As the core model, we design a Transformer-based anchor-free model to improve effectiveness for temporal evaluation. We integrate bi-directional masks and devise encoder-only Masked Transformers for sequences. Jointly training on boundary locations and various local snippet-based features, our model predicts via the proposed scoring function for generating proposal candidates. Experiments on the THUMOS14 and ActivityNet-1.3 benchmarks demonstrate the effectiveness of our model for temporal proposal generation task.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Temporal teacher with masked transformers for semi-supervised action proposal generation

Abstract

Talk to us

Similar Papers

More From: Machine Vision and Applications

Lead the way for us

Journal: Machine Vision and Applications	Publication Date: Mar 15, 2024
License type: CC BY 4.0

Similar Papers

Semi Supervised Learning with Deep Embedded Clustering for Image Classification and Segmentation.
Joseph Enguehard ... Peter O'Halloran
IEEE Access | VOL. 7
Joseph Enguehard, et. al.Joseph Enguehard ... Peter O'Halloran
01 Jan 2019
IEEE Access | VOL. 7

Deep Semi-Supervised Learning
Zeyad Hailat ... Xue-Wen Chen
-
Zeyad Hailat, et. al.Zeyad Hailat ... Xue-Wen Chen
01 Aug 2018
01 Aug 2018

Discriminative Semi-supervised Learning Based on Visual Concept-Like Features
Fang Liu ... Xiaofeng Wu
-
Fang Liu, et. al.Fang Liu ... Xiaofeng Wu
01 Jan 2017
01 Jan 2017

Applying the self-training semi-supervised learning in hierarchical multi-label methods
Araken M Santos ... Anne M P Canuto
-
Araken M Santos, et. al.Araken M Santos ... Anne M P Canuto
01 Jul 2014
01 Jul 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Temporal teacher with masked transformers for semi-supervised action proposal generation

Abstract

Talk to us

Similar Papers

More From: Machine Vision and Applications