A Novel SVM-Based Decoder for Remote Sensing Image Captioning

Genc Hoxha,Farid Melgani

doi:10.1109/tgrs.2021.3105004

Abstract

Most of the remote sensing image captioning (IC) models are based on encoder–decoder frameworks where a convolutional neural network (CNN) encodes the image information and a recurrent neural network (RNN) decodes the image information into a sentence description. In order to achieve good accuracies, encoder–decoder frameworks relying on RNNs typically require a huge amount of annotated samples. Furthermore, they demand high and expensive computational power in order to have reasonable training and testing time. In this article, we aim to address these issues by introducing a novel decoder that is based on support vector machines (SVMs). In particular, instead of RNNs, we propose a novel network of SVMs to decode the image information into a sentence description. The proposed IC system is particularly interesting when just a limited amount of training samples is available. Experiments conducted on four different IC datasets confirm the promising capability of the proposed IC system to generate descriptions that are highly correlated with the image content. The proposed IC system is characterized by short training and inference times compared to other state-of-the-art models.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Novel SVM-Based Decoder for Remote Sensing Image Captioning

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society

Lead the way for us

Journal: IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society	Publication Date: Jan 1, 2022
Citations: 25

Similar Papers

Automated Image Captioning with Multi-layer Gated Recurrent Unit
Ozge Taylan Moral ... Wenwu Wang
-
Ozge Taylan Moral, et. al.Ozge Taylan Moral ... Wenwu Wang
29 Aug 2022
29 Aug 2022

Global Visual Feature and Linguistic State Guided Attention for Remote Sensing Image Captioning
Zhengyuan Zhang ... Xin Gao
IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society | VOL. 60
Zhengyuan Zhang, et. al.Zhengyuan Zhang ... Xin Gao
01 Jan 2021
IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society | VOL. 60

An Image Sentence Generation Based on Deep Neural Network Using RCNN-LSTM Model
S Sai Satyanarayana Reddy ... Ashwani Kumar
-
S Sai Satyanarayana Reddy, et. al.S Sai Satyanarayana Reddy ... Ashwani Kumar
26 Nov 2021
26 Nov 2021

Generating Caption for Image using Beam Search and Analyzation with Unsupervised Image Captioning Algorithm
Prashant Giridhar Shambharkar ... Rajat Kumar
-
Prashant Giridhar Shambharkar, et. al.Prashant Giridhar Shambharkar ... Rajat Kumar
06 May 2021
06 May 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel SVM-Based Decoder for Remote Sensing Image Captioning

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society