Multi-scale spatio-temporal feature adaptive aggregation for video-based Person Re-identification

Wei Zhao,Yan Huang,Guoyou Wang,Bo Zhang,Yuhang Gao,Yuze Liu

doi:10.1016/j.knosys.2024.111980

Abstract

Video-based person re-identification (Re-ID) aims to retrieve the target person from video sequences captured by a distributed camera system. It remains a challenging task due to the reasons such as occlusion and misalignment in the video. To address above problem, many methods are proposed to exploit multi-scale spatio-temporal features in videos. However, established methods typically assign the equal weights to temporal or spatial features at different scales, which significantly diminishes the distinct roles of each feature. In this paper, we propose a novel Multi-scale Feature Aggregation Network (MFANet) for video-based person Re-ID. Specifically, we propose two flexible modules, Multi-scale Temporal Feature Aggregation (MTFA) and Multi-scale Spatial Feature Aggregation (MSFA). These two modules first extract different scales of temporal (dynamic, static) and spatial (coarse and fine) features, and then adaptively assign weights to each feature according to the video sequence. Both of these lightweight modules can be incorporated with 3D Convolutional Neural Network to build our MFANet. Extensive experiments on four public benchmarks demonstrate that MTFA and MSFA improve the performance of baseline architectures, and our MFANet achieves the best performance compared to other state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-scale spatio-temporal feature adaptive aggregation for video-based Person Re-identification

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: May 23, 2024
Citations: 1

Similar Papers

Joint Attentive Spatial-Temporal Feature Aggregation for Video-Based Person Re-Identification
Lin Chen ... Hua Yang
IEEE Access | VOL. 7
Lin Chen, et. al.Lin Chen ... Hua Yang
01 Jan 2019
IEEE Access | VOL. 7

Multi-scale feature aggregation and fusion network with self-supervised multi-level perceptual loss for textures preserving low-dose CT denoising
Yuanke Zhang ... Guangshun Li
Physics in Medicine & Biology | VOL. 69
Yuanke Zhang, et. al.Yuanke Zhang ... Guangshun Li
26 Apr 2024
Physics in Medicine & Biology | VOL. 69

Semantic Segmentation of Point Cloud Scene via Multi-Scale Feature Aggregation and Adaptive Fusion
Baoyun Guo ... Xiaokai Sun
Photogrammetric Engineering & Remote Sensing | VOL. 90
Baoyun Guo, et. al.Baoyun Guo ... Xiaokai Sun
01 Sep 2024
Photogrammetric Engineering & Remote Sensing | VOL. 90

Comprehensive feature fusion mechanism for video-based person re-identification via significance-aware attention
Lin Chen ... Zhiyong Gao
Signal Processing: Image Communication | VOL. 84
Lin Chen, et. al.Lin Chen ... Zhiyong Gao
19 Mar 2020
Signal Processing: Image Communication | VOL. 84

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-scale spatio-temporal feature adaptive aggregation for video-based Person Re-identification

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems