Object Recognition System for the Visually Impaired: A Deep Learning Approach using Arabic Annotation

Nada Alzahrani,Heyam H Al-Baity

doi:10.3390/electronics12030541

Abstract

Object detection is an important computer vision technique that has increasingly attracted the attention of researchers in recent years. The literature to date in the field has introduced a range of object detection models. However, these models have largely been English-language-based, and there is only a limited number of published studies that have addressed how object detection can be implemented for the Arabic language. As far as we are aware, the generation of an Arabic text-to-speech engine to utter objects’ names and their positions in images to help Arabic-speaking visually impaired people has not been investigated previously. Therefore, in this study, we propose an object detection and segmentation model based on the Mask R-CNN algorithm that is capable of identifying and locating different objects in images, then uttering their names and positions in Arabic. The proposed model was trained on the Pascal VOC 2007 and 2012 datasets and evaluated on the Pascal VOC 2007 testing set. We believe that this is one of a few studies that uses these datasets to train and test the Mask R-CNN model. The performance of the proposed object detection model was evaluated and compared with previous object detection models in the literature, and the results demonstrated its superiority and ability to achieve an accuracy of 83.9%. Moreover, experiments were conducted to evaluate the performance of the incorporated translator and TTS engines, and the results showed that the proposed model could be effective in helping Arabic-speaking visually impaired people understand the content of digital images.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Jan 20, 2023
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Object Recognition System for the Visually Impaired: A Deep Learning Approach using Arabic Annotation

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Efficient and precise cell counting for RNAi screening of Orientia tsutsugamushi infection using deep learning techniques
Potjanee Kanchanapiboon ... Panrasee Ritthipravat
Intelligent Systems with Applications | VOL. 21
Potjanee Kanchanapiboon, et. al.Potjanee Kanchanapiboon ... Panrasee Ritthipravat
19 Nov 2023
Intelligent Systems with Applications | VOL. 21

RetinaNet with Harmony Search Algorithm for Dynamic and Static Object Detection and Classification Model
S Kiruthika Devi ... Baranidharan
-
S Kiruthika Devi, et. al.S Kiruthika Devi ... Baranidharan
16 Mar 2022
16 Mar 2022

Road Semantic Segmentation and Traffic Object Detection Model Based on Encoder-Decoder CNN Architecture
Yih-Chen Wang ... Yen-Lin Chen
-
Yih-Chen Wang, et. al.Yih-Chen Wang ... Yen-Lin Chen
06 Jul 2022
06 Jul 2022

End-to-end deep learning for directly estimating grape yield from ground-based imagery
Alexander G Olenskyj ... J Mason Earles
Computers and Electronics in Agriculture | VOL. 198
Alexander G Olenskyj, et. al.Alexander G Olenskyj ... J Mason Earles
11 Jun 2022
Computers and Electronics in Agriculture | VOL. 198

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Object Recognition System for the Visually Impaired: A Deep Learning Approach using Arabic Annotation

Abstract

Talk to us

Similar Papers

More From: Electronics