Mining on Heterogeneous Manifolds for Zero-Shot Cross-Modal Image Retrieval

Fan Yang,Jing Xiao,Shin'Ichi Satoh,Zheng Wang

doi:10.1609/aaai.v34i07.6949

Abstract

Most recent approaches for the zero-shot cross-modal image retrieval map images from different modalities into a uniform feature space to exploit their relevance by using a pre-trained model. Based on the observation that manifolds of zero-shot images are usually deformed and incomplete, we argue that the manifolds of unseen classes are inevitably distorted during the training of a two-stream model that simply maps images from different modalities into a uniform space. This issue directly leads to poor cross-modal retrieval performance. We propose a bi-directional random walk scheme to mining more reliable relationships between images by traversing heterogeneous manifolds in the feature space of each modality. Our proposed method benefits from intra-modal distributions to alleviate the interference caused by noisy similarities in the cross-modal feature space. As a result, we achieved great improvement in the performance of the thermal v.s. visible image retrieval task. The code of this paper: https://github.com/fyang93/cross-modal-retrieval

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mining on Heterogeneous Manifolds for Zero-Shot Cross-Modal Image Retrieval

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 23

Similar Papers

Deep-Learning-based Cross-Modal Luxury Microblogs Retrieval
Menghao Ma ... Wenhe Feng
-
Menghao Ma, et. al.Menghao Ma ... Wenhe Feng
11 Dec 2021
11 Dec 2021

Deep Adversarial Cascaded Hashing for Cross-Modal Vessel Image Retrieval
Jiaen Guo ... Xin Guan
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 16
Jiaen Guo, et. al.Jiaen Guo ... Xin Guan
01 Jan 2023
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 16

Bridging Modalities: A Survey of Cross-Modal Image-Text Retrieval
Tieying Li ... Jiaxing Xu
Chinese Journal of Information Fusion | VOL. 1
Tieying Li, et. al.Tieying Li ... Jiaxing Xu
12 Jun 2024
Chinese Journal of Information Fusion | VOL. 1

A Deep Semantic Alignment Network for the Cross-Modal Image-Text Retrieval in Remote Sensing
Qimin Cheng ... Peng Fu
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 14
Qimin Cheng, et. al.Qimin Cheng ... Peng Fu
01 Jan 2020
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mining on Heterogeneous Manifolds for Zero-Shot Cross-Modal Image Retrieval

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence