Abstract

One of the most difficult tasks for visually impaired people is reading. Blind persons may now recognise boundaries, read labels or certain currencies, and even read written, typed, or printed information, thanks to a technology developed by a group of academics. In this work, an assistance system is designed for blind persons who are visually challenged by using Raspberry Pi as a 32-bit micro controller that utilizes the character recognition technology for image to audio conversion; this work proposes a more effective solution. A camera is the heart of the system, serving as the primary vision for identifying product reports, bills, bank account information, menu boards, school handouts, product packaging, and medication bottle directions. The proposed technique consists of four major steps: i) object identification, ii) text localization, iii) text extraction, iv) text to speech conversion. An optical character recognition (OCR) after recognizing printed text with a technique, the image is digitally processed to extract the label with the use of an open CV library. After the Region of Interest is extracted from the congested backdrop, the text localization technique is utilized to find and extract the text. The ROI's text is retrieved and converted into speech. For training the words independently, a convolutional recurrent neural network technique is proposed.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call