Review Paper on Enhanced Image Captioning with Deep Learning: Encoder-Decoder and Attention Mechanism

Et Al Vikash Kumar Singh

doi:10.17762/ijritcc.v11i9.8866

Abstract

Image captioning involves the generation of textual descriptions that describe the content within an image. This process finds extensive utility in diverse applications, including the analysis of large, unlabelled image datasets, uncovering concealed patterns to facilitate machine learning applications, guiding self-driving vehicles, and developing software solutions to aid visually impaired individuals. The implementation of image captioning relies heavily on deep learning models, a technological frontier that has simplified the task of generating captions for images. This paper focuses on the utilisation of encoder-decoder model with attention mechanism for image captioning. In classic image captioning model, the words usually describe only a part of the image, however with attention mechanism special attention is given to the low level and high level features of the image. Object detection using attention mechanism has shown to have increased the CIDEr score by 15%. With the use of stable dataset of MSCOCO through keras datasets, it is possible to score more on caption generation and accurate description of image.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Review Paper on Enhanced Image Captioning with Deep Learning: Encoder-Decoder and Attention Mechanism

Abstract

Talk to us

Similar Papers

More From: International Journal on Recent and Innovation Trends in Computing and Communication

Lead the way for us

Similar Papers

Synthesis of Vision and Language: Multifaceted Image Captioning Application
Arpit Gupta ... Himanshu Goyal
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 07
Arpit Gupta, et. al.Arpit Gupta ... Himanshu Goyal
23 Dec 2023
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 07

Automatic Image and Video Caption Generation With Deep Learning: A Concise Review and Algorithmic Overlap
Soheyla Amirian ... Hamid R Arabnia
IEEE Access | VOL. 8
Soheyla Amirian, et. al.Soheyla Amirian ... Hamid R Arabnia
01 Jan 2020
IEEE Access | VOL. 8

Neural attention for image captioning: review of outstanding methods
Zanyar Zohourianshahzadi ... Jugal K Kalita
Artificial Intelligence Review | VOL. 55
Zanyar Zohourianshahzadi, et. al.Zanyar Zohourianshahzadi ... Jugal K Kalita
29 Nov 2021
Artificial Intelligence Review | VOL. 55

Enhancing image caption generation through context-aware attention mechanism
Ahatesham Bhuiyan ... M Ali Akber Dewan
Heliyon | VOL. 10
Ahatesham Bhuiyan, et. al.Ahatesham Bhuiyan ... M Ali Akber Dewan
19 Aug 2024
Heliyon | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Review Paper on Enhanced Image Captioning with Deep Learning: Encoder-Decoder and Attention Mechanism

Abstract

Talk to us

Similar Papers

More From: International Journal on Recent and Innovation Trends in Computing and Communication