Multilanguage Transformer for Improved Text to Remote Sensing Image Retrieval

Mohamad M Al Rahhal,Norah A Alsharif,Farid Melgani,Laila Bashmal,Naif Alajlan,Yakoub Bazi

doi:10.1109/jstars.2022.3215803

Abstract

Cross-modal text-image retrieval in remote sensing (RS) provides a flexible retrieval experience for mining useful information from RS repositories. However, existing methods are designed to accept queries formulated in the English language only, which may restrict accessibility to useful information for non-English speakers. Allowing multilanguage queries can enhance the communication with the retrieval system and broaden access to the RS information. To address this limitation, this article proposes a multilanguage framework based on transformers. Specifically, our framework is composed of two transformer encoders for learning modality-specific representations, the first is a language encoder for generating language representation features from the textual description, while the second is a vision encoder for extracting visual features from the corresponding image. The two encoders are trained jointly on image and text pairs by minimizing a bidirectional contrastive loss. To enable the model to understand queries in multiple languages, we trained it on descriptions from four different languages, namely, English, Arabic, French, and Italian. The experimental results on three benchmark datasets (i.e., RSITMD, RSICD, and UCM) demonstrate that the proposed model improves significantly the retrieval performances in terms of recall compared to the existing state-of-the-art RS retrieval methods.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing	Publication Date: Jan 1, 2022
Citations: 14	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multilanguage Transformer for Improved Text to Remote Sensing Image Retrieval

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

Lead the way for us

Similar Papers

Unsupervised Contrastive Hashing for Cross-Modal Retrieval in Remote Sensing
Georgii Mikriukov ... Begum Demir
-
Georgii Mikriukov, et. al.Georgii Mikriukov ... Begum Demir
23 May 2022
23 May 2022

Multi-Attention Fusion and Fine-Grained Alignment for Bidirectional Image-Sentence Retrieval in Remote Sensing
Qimin Cheng ... Haiyan Huang
IEEE/CAA Journal of Automatica Sinica | VOL. 9
Qimin Cheng, et. al.Qimin Cheng ... Haiyan Huang
01 Aug 2022
IEEE/CAA Journal of Automatica Sinica | VOL. 9

An Improved Remote Sensing Retrieval Method for Elevated Duct in the South China Sea
Yinhe Cheng ... Weiye He
Remote Sensing | VOL. 16
Yinhe Cheng, et. al.Yinhe Cheng ... Weiye He
19 Jul 2024
Remote Sensing | VOL. 16

A Lightweight Multi-Scale Crossmodal Text-Image Retrieval Method in Remote Sensing
Zhiqiang Yuan ... Hongqi Wang
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60
Zhiqiang Yuan, et. al.Zhiqiang Yuan ... Hongqi Wang
01 Jan 2021
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multilanguage Transformer for Improved Text to Remote Sensing Image Retrieval

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing