Generation of Image Caption Using CNN-LSTM Based Approach

S Aravindkumar,P Varalakshmi,M Hemalatha

doi:10.1007/978-3-030-16657-1_43

Abstract

Image captioning is gaining attention due to the recent developments in the deep neural architectures. But the gap between semantic concepts and the visual features is a major challenge in image caption generation. In this paper we have developed a method to use both visual features and semantic features for the caption generation. We discuss briefly about the various architectures used for visual feature extraction and Long Short Term Memory (LSTM) for caption generation. An object recognition model has been developed to identify the semantic tags in the images. These tags are encoded along with the visual features for the captioning task. We have developed an Encoder-Decoder architecture using the semantic details along with the language model for the caption generation. We evaluated our model with standard datasets like Flickr8k, Flickr30k and MSCOCO using standard metrics like BLEU and METEOR.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generation of Image Caption Using CNN-LSTM Based Approach

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Comparative Evaluation of CNN Architectures for Image Caption Generation
Sulabh Katiyar ... Samir Kumar
International Journal of Advanced Computer Science and Applications | VOL. 11
Sulabh Katiyar, et. al.Sulabh Katiyar ... Samir Kumar
01 Jan 2020
International Journal of Advanced Computer Science and Applications | VOL. 11

Image Caption Generator by using CNN and LSTM
S Pasupathy
International Journal For Multidisciplinary Research | VOL. 5
S Pasupathy S Pasupathy
23 Apr 2023
International Journal For Multidisciplinary Research | VOL. 5

Synthesis of Vision and Language: Multifaceted Image Captioning Application
Arpit Gupta ... Himanshu Goyal
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 07
Arpit Gupta, et. al.Arpit Gupta ... Himanshu Goyal
23 Dec 2023
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 07

Chinese Image Caption Generation via Visual Attention and Topic Modeling.
Maofu Liu ... Lingjun Li
IEEE Transactions on Cybernetics | VOL. 52
Maofu Liu, et. al.Maofu Liu ... Lingjun Li
22 Jun 2020
IEEE Transactions on Cybernetics | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generation of Image Caption Using CNN-LSTM Based Approach

Abstract

Talk to us

Similar Papers