Active Spatial Positions Based Hierarchical Relation Inference for Group Activity Recognition

Lifang Wu,Zun Li,Changwen Chen,Xianglong Lang,Ye Xiang,Zhuming Wang

doi:10.1109/tcsvt.2022.3228731

Abstract

Group activity recognition aims to recognize behaviors characterized by multiple individuals within a scene. Existing schemes rely on individual relation inference and usually take the individuals as tokens. Essentially they select the most relevant region of the group activity from the entire image while filtering out irrelevant background noises. However, these schemes require individual bounding box labeling in both training and testing stages. Since individuals have usually been presented at one scale, multi-scale individuals cannot be combined in an effective way. In this paper, we present a novel end-to-end hierarchical relation inference framework based on active spatial positions for group activity recognition. This framework is designed to locate active spatial positions and use them as visual tokens to infer the relations for token embeddings. It requires individual bounding box labeling only in the training stage while automatically eliminating the background after locating active spatial positions from the entire scene. The hierarchical relations can be naturally inferred based on the visual tokens at different scales, contributing to further performance improvement. Experimental results demonstrate that the proposed framework is competitive against existing schemes that require more laboring and computation to generate labels in both the training and testing stage.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Active Spatial Positions Based Hierarchical Relation Inference for Group Activity Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Jun 1, 2023
Citations: 4

Similar Papers

MVDG: A Unified Multi-view Framework for Domain Generalization
Jian Zhang ... Yinghuan Shi
-
Jian Zhang, et. al.Jian Zhang ... Yinghuan Shi
01 Jan 2021
01 Jan 2021

Estimation of wind turbine output power using soft computing models
Sergen Tümse ... Beşir Şahin
Energy Sources, Part A: Recovery, Utilization, and Environmental Effects | VOL. 44
Sergen Tümse, et. al.Sergen Tümse ... Beşir Şahin
03 May 2022
Energy Sources, Part A: Recovery, Utilization, and Environmental Effects | VOL. 44

The Successive Next Network as Augmented Regularization for Deformable Brain MR Image Registration.
Meng Li ... Guoqiang Li
Sensors (Basel, Switzerland) | VOL. 23
Meng Li, et. al.Meng Li ... Guoqiang Li
17 Mar 2023
Sensors (Basel, Switzerland) | VOL. 23

A random forest approach for predicting coal spontaneous combustion
Changkui Lei ... Lifeng Ren
Fuel | VOL. 223
Changkui Lei, et. al.Changkui Lei ... Lifeng Ren
10 Mar 2018
Fuel | VOL. 223

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Active Spatial Positions Based Hierarchical Relation Inference for Group Activity Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society