Soft Contrastive Cross-Modal Retrieval

Jiayu Song,Jian Zhang,Shichao Zhang,Lei Zhu,Chengyuan Zhang,Yuxuan Hu

doi:10.3390/app14051944

Abstract

Cross-modal retrieval plays a key role in the Natural Language Processing area, which aims to retrieve one modality to another efficiently. Despite the notable achievements of existing cross-modal retrieval methodologies, the complexity of the embedding space increases with more complex models, leading to less interpretable and potentially overfitting representations. Most existing methods realize outstanding results based on datasets without any error or noise, but that is extremely ideal and leads to trained models lacking robustness. To solve these problems, in this paper, we propose a novel approach, Soft Contrastive Cross-Modal Retrieval (SCCMR), which integrates the deep cross-modal model with soft contrastive learning and smooth label cross-entropy learning to boost common subspace embedding and improve the generalizability and robustness of the model. To confirm the performance and effectiveness of SCCMR, we conduct extensive experiments comparing 12 state-of-the-art methods on three multi-modal datasets by using image–text retrieval as a showcase. The experimental results show that our proposed method outperforms the baselines.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Soft Contrastive Cross-Modal Retrieval

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Journal: Applied Sciences	Publication Date: Feb 27, 2024
License type: CC BY 4.0

Similar Papers

A Deep Semantic Alignment Network for the Cross-Modal Image-Text Retrieval in Remote Sensing
Qimin Cheng ... Peng Fu
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 14
Qimin Cheng, et. al.Qimin Cheng ... Peng Fu
01 Jan 2020
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 14

Unsupervised Contrastive Hashing for Cross-Modal Retrieval in Remote Sensing
Georgii Mikriukov ... Mahdyar Ravanbakhsh
-
Georgii Mikriukov, et. al.Georgii Mikriukov ... Mahdyar Ravanbakhsh
23 May 2022
23 May 2022

Bridging Modalities: A Survey of Cross-Modal Image-Text Retrieval
Tieying Li ... Jiaxing Xu
Chinese Journal of Information Fusion | VOL. 1
Tieying Li, et. al.Tieying Li ... Jiaxing Xu
12 Jun 2024
Chinese Journal of Information Fusion | VOL. 1

Review of Recent Deep Learning Based Methods for Image-Text Retrieval
Jianan Chen ... Cong Bai
-
Jianan Chen, et. al.Jianan Chen ... Cong Bai
17 Feb 2020
17 Feb 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Soft Contrastive Cross-Modal Retrieval

Abstract

Talk to us

Similar Papers

More From: Applied Sciences