Toward Remote Sensing Image Retrieval Under a Deep Image Captioning Perspective

Genc Hoxha,Farid Melgani,Begum Demir

doi:10.1109/jstars.2020.3013818

Abstract

The performance of remote sensing image retrieval (RSIR) systems depends on the capability of the extracted features in characterizing the semantic content of images. Existing RSIR systems describe images by visual descriptors that model the primitives (such as different land-cover classes) present in the images. However, the visual descriptors may not be sufficient to describe the high-level complex content of RS images (e.g., attributes and relationships among different land-cover classes). To address this issue, in this article, we present an RSIR system that aims at generating and exploiting textual descriptions to accurately describe the relationships between the objects and their attributes present in RS images with captions (i.e., sentences). To this end, the proposed retrieval system consists of three main steps. The first step aims to encode the image visual features and then translate the encoded features into a textual description that summarizes the content of the image with captions. This is achieved based on the combination of a convolutional neural network with a recurrent neural network. The second step aims to convert the generated textual descriptions into semantically meaningful feature vectors. This is achieved by using the recent word embedding techniques. Finally, the last step estimates the similarity between the vectors of the textual descriptions of the query image and those of the archive images, and then retrieve the most similar images to the query image. Experimental results obtained on two different datasets show that the description of the image content with captions in the framework of RSIR leads to an accurate retrieval performance.

Highlights

R ECENT advances in satellite technology result in an explosive growth of remote sensing (RS) image archives
In the RS community, a great attention is devoted to content-based image retrieval that aims to search and retrieve the most similar images to a query image based on two main steps: Manuscript received January 23, 2020; revised April 19, 2020, June 18, 2020, and July 22, 2020; accepted July 27, 2020
The proposed methodology consists of three main steps, which are as follows: 1) image caption generation; 2) sentence encoding; and 3) image retrieval based on the encoded sentences of images

Summary

Introduction

R ECENT advances in satellite technology result in an explosive growth of remote sensing (RS) image archives. One of the important research topics is the development of accurate RS image retrieval (RSIR) systems to retrieve the most relevant images to a query image from such massive archives. The traditional content-based RSIR systems rely on hand-crafted features to describe the semantic content of images To this end, several visual descriptors are presented in RS. Unsupervised methods compute the similarity between the visual features of the query image and those of the archive images and retrieve the most similar images to the query. To this end, one can use the k-nearest neighbor algorithm. In [9], a sparse reconstruction-based method that generalizes the standard sparse classifier to the case of multilabel RS image retrieval problems is introduced

Objectives

Methods

Results

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing	Publication Date: Jan 1, 2020
Citations: 90	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Toward Remote Sensing Image Retrieval Under a Deep Image Captioning Perspective

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

Lead the way for us

Similar Papers

Retrieving Images with Generated Textual Descriptions
Genc Hoxha ... Farid Melgani
-
Genc Hoxha, et. al.Genc Hoxha ... Farid Melgani
01 Jul 2019
01 Jul 2019

A Novel System for Content-Based Retrieval of Single and Multi-Label High-Dimensional Remote Sensing Images
Osman Emre Dai ... Lorenzo Bruzzone
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 11
Osman Emre Dai, et. al.Osman Emre Dai ... Lorenzo Bruzzone
01 Jul 2018
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 11

Coarse-to-Fine Deep Metric Learning for Remote Sensing Image Retrieval
Min-Sub Yun ... Woo-Jeoung Nam
Remote Sensing | VOL. 12
Min-Sub Yun, et. al.Min-Sub Yun ... Woo-Jeoung Nam
08 Jan 2020
Remote Sensing | VOL. 12

An analysis on deep learning approaches: addressing the challenges in remote sensing image retrieval
S K Sudha ... S Aji
International Journal of Remote Sensing | VOL. 42
S K Sudha, et. al.S K Sudha ... S Aji
17 Nov 2021
International Journal of Remote Sensing | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Toward Remote Sensing Image Retrieval Under a Deep Image Captioning Perspective

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing