Rethinking Temporal Fusion for Video-Based Person Re-Identification on Semantic and Time Aspect

Xinyang Jiang,Feng Zheng,Wei-Shi Zheng,Xing Sun,Qize Yang,Xiaowei Guo,Yifei Gong,Feiyue Huang

doi:10.1609/aaai.v34i07.6770

Abstract

Recently, the research interest of person re-identification (ReID) has gradually turned to video-based methods, which acquire a person representation by aggregating frame features of an entire video. However, existing video-based ReID methods do not consider the semantic difference brought by the outputs of different network stages, which potentially compromises the information richness of the person features. Furthermore, traditional methods ignore important relationship among frames, which causes information redundancy in fusion along the time axis. To address these issues, we propose a novel general temporal fusion framework to aggregate frame features on both semantic aspect and time aspect. As for the semantic aspect, a multi-stage fusion network is explored to fuse richer frame features at multiple semantic levels, which can effectively reduce the information loss caused by the traditional single-stage fusion. While, for the time axis, the existing intra-frame attention method is improved by adding a novel inter-frame attention module, which effectively reduces the information redundancy in temporal fusion by taking the relationship among frames into consideration. The experimental results show that our approach can effectively improve the video-based re-identification accuracy, achieving the state-of-the-art performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rethinking Temporal Fusion for Video-Based Person Re-Identification on Semantic and Time Aspect

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 13

Similar Papers

A review on video person re-identification based on deep learning
Haifei Ma ... Chunrong Wei
Neurocomputing | VOL. 609
Haifei Ma, et. al.Haifei Ma ... Chunrong Wei
28 Aug 2024
Neurocomputing | VOL. 609

Multi-Scale Feature Fusion Network for Video-Based Person Re-Identification
Penggao Liu ... Mingjing Ai
-
Penggao Liu, et. al.Penggao Liu ... Mingjing Ai
27 Aug 2021
27 Aug 2021

Deep video-based person re-identification (Deep Vid-ReID): comprehensive survey
Rana S M Saad ... Hesham Farouk
EURASIP Journal on Advances in Signal Processing | VOL. 2024
Rana S M Saad, et. al.Rana S M Saad ... Hesham Farouk
15 May 2024
EURASIP Journal on Advances in Signal Processing | VOL. 2024

Effective multi-shot person re-identification through representative frames selection and temporal feature pooling
Thuy-Binh Nguyen ... Thi-Lan Le
Multimedia Tools and Applications | VOL. 78
Thuy-Binh Nguyen, et. al.Thuy-Binh Nguyen ... Thi-Lan Le
12 Oct 2019
Multimedia Tools and Applications | VOL. 78

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rethinking Temporal Fusion for Video-Based Person Re-Identification on Semantic and Time Aspect

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence