Image Captioning using Convolutional Neural Networks and Long Short Term Memory Cells

Hitoishi Das

doi:10.35940/ijrte.e6741.0511122

Abstract

This paper discusses an efficient approach to captioning a given image using a combination of Convolutional Neural Network (CNN) and Recurrent Neural Networks (RNN) with Long Short Term Memory Cells (LSTM). Image captioning is a realm of deep learning and computer vision which deals with generating relevant captions for a given input image. The research in this area includes the hyperparameter tuning of Convolutional Neural Networks and Recurrent Neural Networks to generate captions which are as accurate as possible. The basic outline of the process includes giving an image as input to the CNN which outputs a feature map. This feature map is passed as input to the RNN which outputs a sentence describing the image. The research in image captioning is relevant because this method demonstrates the true power of the encoder-decoder network made up of Convolutional Neural Network and Recurrent Neural Network and potentially will open many pathways for further interesting research on different types of neural networks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Image Captioning using Convolutional Neural Networks and Long Short Term Memory Cells

Abstract

Talk to us

Similar Papers

More From: International Journal of Recent Technology and Engineering (IJRTE)

Lead the way for us

Journal: International Journal of Recent Technology and Engineering (IJRTE)	Publication Date: May 30, 2022
License type: cc-by-nc-nd

Similar Papers

Image Captioning using Convolutional Neural Networks and Recurrent Neural Network
Rachel Calvin ... Shravya Suresh
-
Rachel Calvin, et. al.Rachel Calvin ... Shravya Suresh
02 Apr 2021
02 Apr 2021

Recurrent convolutional neural network for answer selection in community question answering
Xiaoqiang Zhou ... Baotian Hu
Neurocomputing | VOL. 274
Xiaoqiang Zhou, et. al.Xiaoqiang Zhou ... Baotian Hu
11 Apr 2017
Neurocomputing | VOL. 274

An Image Sentence Generation Based on Deep Neural Network Using RCNN-LSTM Model
S Sai Satyanarayana Reddy ... M Jyaram
-
S Sai Satyanarayana Reddy, et. al.S Sai Satyanarayana Reddy ... M Jyaram
26 Nov 2021
26 Nov 2021

Lip-Reading Research Based on ShuffleNet and Attention-GRU
Yixian Fu ... Yuanyao Lu
-
Yixian Fu, et. al.Yixian Fu ... Yuanyao Lu
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Image Captioning using Convolutional Neural Networks and Long Short Term Memory Cells

Abstract

Talk to us

Similar Papers

More From: International Journal of Recent Technology and Engineering (IJRTE)