Image-to-video person re-identification with cross-modal embeddings

Zhongwei Xie,Lin Li,Xian Zhong,Luo Zhong,Jianwen Xiang

doi:10.1016/j.patrec.2019.03.003

Abstract

Despite the great progress achieved, image-to-video person re-identification is still challenging in the cross-modal scenario. Currently, state-of-the-art approaches mainly concentrate on the task-specific data, neglecting the extra information from the different but related tasks. In this paper, we propose an end-to-end neural network framework for image-to-video person re-identification with cross-modal embeddings learned from extra information. Concretely speaking, cross-modal embedding layers from image captioning and video captioning models, are incorporated to learn common latent embeddings for multiple modalities. The learned multimodal embeddings are expected to focus on person’s prominent distinctions, due to textual descriptive information generally paying close attention to person’s explicit characteristics. Apart from that, our proposed framework resorts to CNNs and LSTMs for extracting visual and spatiotemporal features, and combines the strengths of identification and verification model to improve the discriminative ability of the learned features. The experimental results demonstrate the effectiveness of our framework on narrowing down the gap between heterogeneous data and obtaining observable improvement in the image-to-video person re-identification task.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Image-to-video person re-identification with cross-modal embeddings

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Mar 7, 2019
Citations: 8

Similar Papers

Person re-identification based on frequency channel attention networks under the surveillance scenario
Shengbo Chen ... Hongchang Zhang
Journal of Physics: Conference Series | VOL. 1966
Shengbo Chen, et. al.Shengbo Chen ... Hongchang Zhang
01 Jul 2021
Journal of Physics: Conference Series | VOL. 1966

Cross-Media Body-Part Attention Network for Image-to-Video Person Re-Identification
Benzhi Yu ... Ning Xu
IEEE Access | VOL. 7
Benzhi Yu, et. al.Benzhi Yu ... Ning Xu
01 Jan 2019
IEEE Access | VOL. 7

Recent progress in person re-ID
Zhang Yongfei ... Wang Shengjin
Journal of Image and Graphics | VOL. 28
Zhang Yongfei, et. al.Zhang Yongfei ... Wang Shengjin
01 Jan 2023
Journal of Image and Graphics | VOL. 28

Distance Metric Learning Using Privileged Information for Face Verification and Person Re-Identification.
Xinxing Xu ... Wen Li
IEEE Transactions on Neural Networks and Learning Systems | VOL. 26
Xinxing Xu, et. al.Xinxing Xu ... Wen Li
12 Mar 2015
IEEE Transactions on Neural Networks and Learning Systems | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Image-to-video person re-identification with cross-modal embeddings

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters