Selective and Orthogonal Feature Activation for Pedestrian Attribute Recognition

Junyi Wu,Yan Huang,Zhipeng Gao,Min Gao,Jianqiang Zhao,Mingjing Yang,Yuzhen Niu

doi:10.1609/aaai.v38i6.28419

Abstract

Pedestrian Attribute Recognition (PAR) involves identifying the attributes of individuals in person images. Existing PAR methods typically rely on CNNs as the backbone network to extract pedestrian features. However, CNNs process only one adjacent region at a time, leading to the loss of long-range inter-relations between different attribute-specific regions. To address this limitation, we leverage the Vision Transformer (ViT) instead of CNNs as the backbone for PAR, aiming to model long-range relations and extract more robust features. However, PAR suffers from an inherent attribute imbalance issue, causing ViT to naturally focus more on attributes that appear frequently in the training set and ignore some pedestrian attributes that appear less. The native features extracted by ViT are not able to tolerate the imbalance attribute distribution issue. To tackle this issue, we propose two novel components: the Selective Feature Activation Method (SFAM) and the Orthogonal Feature Activation Loss. SFAM smartly suppresses the more informative attribute-specific features, compelling the PAR model to capture discriminative features from regions that are easily overlooked. The proposed loss enforces an orthogonal constraint on the original feature extracted by ViT and the suppressed features from SFAM, promoting the complementarity of features in space. We conduct experiments on several benchmark PAR datasets, including PETA, PA100K, RAPv1, and RAPv2, demonstrating the effectiveness of our method. Specifically, our method outperforms existing state-of-the-art approaches by GRL, IAA-Caps, ALM, and SSC in terms of mA on the four datasets, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Selective and Orthogonal Feature Activation for Pedestrian Attribute Recognition

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Attention Based CNN-ConvLSTM for Pedestrian Attribute Recognition.
Yang Li ... Junsheng Xiao
Sensors | VOL. 20
Yang Li, et. al.Yang Li ... Junsheng Xiao
03 Feb 2020
Sensors | VOL. 20

A novel self-boosting dual-branch model for pedestrian attribute recognition
Yilu Cao ... Wei Huang
Signal Processing: Image Communication | VOL. 115
Yilu Cao, et. al.Yilu Cao ... Wei Huang
14 Apr 2023
Signal Processing: Image Communication | VOL. 115

Pedestrian Attribute Recognition with Part-based CNN and Combined Feature Representations
Yiqiang Chen ... Atilla Baskurt
-
Yiqiang Chen, et. al.Yiqiang Chen ... Atilla Baskurt
01 Jan 2018
01 Jan 2018

Recurrent Attention Model for Pedestrian Attribute Recognition
Xin Zhao ... Na Di
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33
Xin Zhao, et. al.Xin Zhao ... Na Di
17 Jul 2019
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Selective and Orthogonal Feature Activation for Pedestrian Attribute Recognition

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence