Pedestrian Attribute Recognition via Spatio-temporal Relationship Learning for Visual Surveillance

Zhenyu Liu,Zhang Zhang,Jungong Han,Caifeng Shan,Da Li

doi:10.1145/3632624

Abstract

Pedestrian attribute recognition (PAR) aims at predicting the visual attributes of a pedestrian image. PAR has been used as soft biometrics for visual surveillance and IoT security. Most of the current PAR methods are developed based on discrete images. However, it is challenging for the image-based method to handle the occlusion and action-related attributes in real-world applications. Recently, video-based PAR has attracted much attention in order to exploit the temporal cues in the video sequences for better PAR. Unfortunately, existing methods usually ignore the correlations among different attributes and the relations between attributes and spatio regions. To address this problem, we propose a novel method for video-based PAR by exploring the relationships among different attributes in both the spatio and temporal domains. More specifically, a spatio-temporal saliency module (STSM) is introduced to capture the key visual patterns from the video sequences, and a module for spatio-temporal attribute relationship learning (STARL) is proposed to mine the correlations among these patterns. Meanwhile, a large-scale benchmark for video-based PAR, RAP-Video, is built by extending the image-based dataset RAP-2, which contains 83,216 tracklets with 25 scenes. To the best of our knowledge, this is the largest dataset for video-based PAR. Extensive experiments are performed on the proposed benchmark as well as on MARS Attribute and DukeMTMC-Video Attribute. The superior performance demonstrates the effectiveness of the proposed method.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Pedestrian Attribute Recognition via Spatio-temporal Relationship Learning for Visual Surveillance

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Journal: ACM Transactions on Multimedia Computing, Communications, and Applications	Publication Date: Mar 8, 2024
Citations: 1

Similar Papers

A Simple Visual-Textual Baseline for Pedestrian Attribute Recognition
Xinhua Cheng ... Jian Zhang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32
Xinhua Cheng, et. al.Xinhua Cheng ... Jian Zhang
01 Oct 2022
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32

A novel self-boosting dual-branch model for pedestrian attribute recognition
Yilu Cao ... Wei Huang
Signal Processing: Image Communication | VOL. 115
Yilu Cao, et. al.Yilu Cao ... Wei Huang
14 Apr 2023
Signal Processing: Image Communication | VOL. 115

YinYang-Net: Complementing Face and Body Information for Wild Gender Recognition
Tiago Roxo ... Hugo Proenca
IEEE Access | VOL. 10
Tiago Roxo, et. al.Tiago Roxo ... Hugo Proenca
01 Jan 2021
IEEE Access | VOL. 10

Pedestrian Attribute Recognition Based on Multi-Scale Fusion and Cross Attention
Wenbiao Xie ... Xiaomei Xie
-
Wenbiao Xie, et. al.Wenbiao Xie ... Xiaomei Xie
22 Jul 2022
22 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Pedestrian Attribute Recognition via Spatio-temporal Relationship Learning for Visual Surveillance

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications