Neural Image Caption Generation with Visual Attention : Enabling Image Accessibility for the Visually Impaired

Priyanka Agarwal Priyanka Agarwal,Shreyanth S Shreyanth S,Sarveshwaran R Sarveshwaran R,Rajesh P K Rajesh P K,Niveditha S Niveditha S

doi:10.32628/ijsrset23103151

Abstract

The internet is saturated with images that convey messages and emotions more effectively than words alone in today's digital age. Individuals with visual impairments, who are unable to perceive and comprehend these images, face significant obstacles in this visual-centric online environment. As there are millions of visually impaired people around the globe, it is essential to close this accessibility gap and enable them to interact with online visual content. We propose a novel model for neural image caption generation with visual attention to address this pressing issue. Our model uses a combination of CNNs and RNNs to convert the content of images into aural descriptions, making them accessible to the visually impaired. The primary objective of our project is to generate captions that accurately and effectively describe the visual elements of an image. The model proposed operates in two phases. First, a text-to-speech API is utilized to convert the image's content into a textual description. The extracted textual description is then converted to audio, allowing visually impaired individuals to perceive visual information through sound. Through exhaustive experimentation and evaluation, we intend to achieve a high level of precision and descriptivism in our system for image captioning. We will evaluate the performance of the model by undertaking comprehensive qualitative and quantitative assessments, comparing its generated captions to ground truth captions annotated by humans. By enabling visually impaired individuals to access and comprehend online images, our research promotes digital inclusion and equality. It has the potential to improve the online experience for millions of visually impaired people, enabling them to interact with visual content and enriching their lives through meaningful image-based interactions.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Neural Image Caption Generation with Visual Attention : Enabling Image Accessibility for the Visually Impaired

Abstract

Talk to us

Similar Papers

More From: International journal of scientific research in science, engineering and technology

Lead the way for us

Journal: International journal of scientific research in science, engineering and technology	Publication Date: Jun 10, 2023
License type: cc-by

Similar Papers

Cortical Visual Impairment
L H Ospina
Pediatrics in Review | VOL. 30
L H OspinaL H Ospina
01 Nov 2009
Pediatrics in Review | VOL. 30

Red Deer Optimization with Artificial Intelligence Enabled Image Captioning System for Visually Impaired People
Anwer Mustafa Hilal ... Radwa Marzouk
Computer Systems Science and Engineering | VOL. 46
Anwer Mustafa Hilal, et. al.Anwer Mustafa Hilal ... Radwa Marzouk
01 Jan 2023
Computer Systems Science and Engineering | VOL. 46

Refractive errors and visual impairment in 900 adults with intellectual disabilities in the Netherlands.
Jacques Van Splunder ... Roos M D Bernsen
Acta Ophthalmologica Scandinavica | VOL. 81
Jacques Van Splunder, et. al.Jacques Van Splunder ... Roos M D Bernsen
01 Apr 2003
Acta Ophthalmologica Scandinavica | VOL. 81

Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Zheng-Jun Zha ... Hanwang Zhang
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44
Zheng-Jun Zha, et. al.Zheng-Jun Zha ... Hanwang Zhang
09 Apr 2019
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neural Image Caption Generation with Visual Attention : Enabling Image Accessibility for the Visually Impaired

Abstract

Talk to us

Similar Papers

More From: International journal of scientific research in science, engineering and technology