Robust Video-Based Person Re-Identification by Hierarchical Mining

Zhikang Wang,Xinbo Gao,Jiashi Feng,Lihuo He,Shengmei Shen,Xiaoguang Tu,Jian Zhao

doi:10.1109/tcsvt.2021.3076097

Abstract

Video-based person re-identification (Re-ID) aims at retrieving the person through the video sequences across non-overlapping cameras. Some characteristics of pedestrians are not consecutive across frames due to the variations of viewpoints, postures, and occlusions over time. However, existing methods ignore such data peculiarity and the networks tend to only learn those salient consecutive characteristics among frames in video sequences. As a result, the learned representations fail to cover all the characteristics of pedestrians, thus lacking integrity and discrimination. To tackle this problem, we present a novel deep architecture termed Hierarchical Mining Network (HMN), which mines as many pedestrians’ characteristics by referring to the temporal and intra-class knowledge. It consists of a novel Attentive Temporal Module (ATM) and a Dynamic Supervising Branch (DSB), with a Balancing Triplet Loss (BTL) assisting the training. The proposed ATM, with pedestrian perceiving capacity, is capable of evaluating each activation of features through temporal analysis, so that the temporally scattered characteristics of pedestrians can be better aggregated and the contaminated ones can be eliminated. Then, the DSB along with the BTL further enhances the integrity of representations by multiple supervision. Specifically, the DSB perceives the diversities of intra-class samples in each mini-batch and generates targeted supervising signals for them, in which process the BTL guarantees the signals with smaller intra-class variations and larger inter-class variations. Comprehensive experiments on two video-based datasets, i.e., MARS, and DukeMTMC-VideoReID, demonstrate the contribution of each component and the superiority of the proposed HMN over the state-of-the-arts. Benchmarking our model on three popular image-based datasets, i.e., Market1501, DukeMTMC-Reid, and MSMT17 additionally verifies the promising generalizability of the proposed DSB and BTL.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust Video-Based Person Re-Identification by Hierarchical Mining

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Apr 29, 2021
Citations: 27

Similar Papers

Relation-based global-partial feature learning network for video-based person re-identification
Fan Yang ... Wei Li
Neurocomputing | VOL. 488
Fan Yang, et. al.Fan Yang ... Wei Li
12 Mar 2022
Neurocomputing | VOL. 488

Two-Level Attention Model Based Video Action Recognition Network
Haifeng Sang ... Dakuo He
IEEE Access | VOL. 7
Haifeng Sang, et. al.Haifeng Sang ... Dakuo He
01 Jan 2019
IEEE Access | VOL. 7

Scalar fields features of video sequences energy characteristics
S.V. Vasilyev ... I.V. Zhigulina
Radioengineering | VOL. 2
S.V. Vasilyev, et. al.S.V. Vasilyev ... I.V. Zhigulina
01 Feb 2024
Radioengineering | VOL. 2

Viewing from Frequency Domain
Liangchen Liu ... Nannan Wang
-
Liangchen Liu, et. al.Liangchen Liu ... Nannan Wang
17 Oct 2021
17 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Video-Based Person Re-Identification by Hierarchical Mining

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology