Implementation of Simple and Efficient Picture Caption Generator

Vijay Mane,Riddhi Selkar

doi:10.48001/joaii.2023.1111-18

Abstract

Image captioning or picture captioning has become one of the most widely used technologies in applications that generate and provide captions for specific photographs. All these things are done with the help of deep neural networks. It identifies the specific objects in an image and their attributes and relationships. The purpose of this research is to find different things in a photograph, figure out their relationships, and write captions. The proposed system is implemented on dataset Flickr8k along with python. The input images are pre-processed and then features from images are extracted using CNN. To translate the features and objects extracted by CNN to a natural sentence in English LSTM is utilized in the implementation. Different types of images are tested with the proposed system. The results are presented with the generated image captions. The results presented shows the accuracy of the system. The presented method has potentials for such applications where image captioning is essential.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Implementation of Simple and Efficient Picture Caption Generator

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence and Imaging

Lead the way for us

Similar Papers

Synthesis of Vision and Language: Multifaceted Image Captioning Application
Arpit Gupta ... Himanshu Goyal
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 07
Arpit Gupta, et. al.Arpit Gupta ... Himanshu Goyal
23 Dec 2023
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 07

Hand Sign to Bangla Speech: A Deep Learning in Vision Based System for Recognizing Hand Sign Digits and Generating Bangla Speech
Shahjalal Ahmed ... Bilkis Jamal Ferdosi
SSRN Electronic Journal | VOL. -
Shahjalal Ahmed, et. al.Shahjalal Ahmed ... Bilkis Jamal Ferdosi
01 Jan 2019
SSRN Electronic Journal | VOL. -

Multi-GRU Based Automated Image Captioning for Smartphones
Rumeysa Keskin ... Volkan Kılıç
-
Rumeysa Keskin, et. al.Rumeysa Keskin ... Volkan Kılıç
09 Jun 2021
09 Jun 2021

Multimodal Transformer With Multi-View Visual Representation for Image Captioning
Jun Yu ... Qingming Huang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 30
Jun Yu, et. al.Jun Yu ... Qingming Huang
25 Oct 2019
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Implementation of Simple and Efficient Picture Caption Generator

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence and Imaging