Enhanced Image Captioning with Color Recognition Using Deep Learning Methods

Yeong-Hwa Chang,Yen-Jen Chen,Yi-Ting Yu,Ren-Hung Huang

doi:10.3390/app12010209

Yeong-Hwa Chang, Yen-Jen Chen + Show 2 more

Open Access

https://doi.org/10.3390/app12010209

Copy DOI

Abstract

Automatically describing the content of an image is an interesting and challenging task in artificial intelligence. In this paper, an enhanced image captioning model—including object detection, color analysis, and image captioning—is proposed to automatically generate the textual descriptions of images. In an encoder–decoder model for image captioning, VGG16 is used as an encoder and an LSTM (long short-term memory) network with attention is used as a decoder. In addition, Mask R-CNN with OpenCV is used for object detection and color analysis. The integration of the image caption and color recognition is then performed to provide better descriptive details of images. Moreover, the generated textual sentence is converted into speech. The validation results illustrate that the proposed method can provide more accurate description of images.

Highlights

Image captioning essentially comprises two tasks: computer vision, and natural language processing (NLP)
Computer vision helps to recognize and understand the scenario presented in an image, and NLP converts this semantic knowledge into a descriptive sentence
Image captioning can be used in social media to automatically generate the caption for a posted image or to describe a video in real time

Summary

Introduction

Image captioning essentially comprises two tasks: computer vision, and natural language processing (NLP). Image captioning has many applications—for instance, as an aid developed to guide visually challenged people in travelling independently. This can be done by first converting the scenario into text and transferring the text to voice messages. Automatic captioning could improve the Google image search technique by converting the image into a caption and using the keywords for further related searches. It can be used in surveillance, by generating the relevant captions from CCTV cameras and raising alarms if any suspicious activity is detected [1]

Related Works

Methods

Implementation

Preliminary Identification

Image Captioning and Object Recognition

Conclusions and Future Work

Findings

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Dec 26, 2021
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Enhanced Image Captioning with Color Recognition Using Deep Learning Methods

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Prediction of Sorption Processes Using the Deep Learning Methods (Long Short-Term Memory)
Dorian Skrobek ... Katarzyna Ciesielska
Energies | VOL. 13
Dorian Skrobek, et. al.Dorian Skrobek ... Katarzyna Ciesielska
14 Dec 2020
Energies | VOL. 13

Evolution of Long Short-Term Memory (LSTM) in Air Pollution Forecasting
Satheesh Abimannan ... Deepak Jayaswal
-
Satheesh Abimannan, et. al.Satheesh Abimannan ... Deepak Jayaswal
23 May 2022
23 May 2022

DAA: Dual LSTMs with adaptive attention for image captioning
Fen Xiao ... Xieping Gao
Neurocomputing | VOL. 364
Fen Xiao, et. al.Fen Xiao ... Xieping Gao
22 Jul 2019
Neurocomputing | VOL. 364

Ensembles of Deep LSTM Learners for Activity Recognition using Wearables
Yu Guan ... Thomas Plötz
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies | VOL. 1
Yu Guan, et. al.Yu Guan ... Thomas Plötz
30 Jun 2017
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhanced Image Captioning with Color Recognition Using Deep Learning Methods

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences