Improvement of the end-to-end scene text recognition method for “text-to-speech” conversion

Fazliddin Makhmudov,Utkir Khamdamov,Kuldoshbay Avazov,Akmalbek Abdusalomov,Young Im Cho,Mukhriddin Mukhiddinov

doi:10.1142/s0219691320500526

Abstract

Methods for text detection and recognition in images of natural scenes have become an active research topic in computer vision and have obtained encouraging achievements over several benchmarks. In this paper, we introduce a robust yet simple pipeline that produces accurate and fast text detection and recognition for the Uzbek language in natural scene images using a fully convolutional network and the Tesseract OCR engine. First, the text detection step quickly predicts text in random orientations in full-color images with a single fully convolutional neural network, discarding redundant intermediate stages. Then, the text recognition step recognizes the Uzbek language, including both the Latin and Cyrillic alphabets, using a trained Tesseract OCR engine. Finally, the recognized text can be pronounced using the Uzbek language text-to-speech synthesizer. The proposed method was tested on the ICDAR 2013, ICDAR 2015 and MSRA-TD500 datasets, and it showed an advantage in efficiently detecting and recognizing text from natural scene images for assisting the visually impaired.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Wavelets, Multiresolution and Information Processing	Publication Date: Sep 15, 2020
Citations: 21	License type: cc-by

R Discovery Prime

R Discovery Prime

Improvement of the end-to-end scene text recognition method for “text-to-speech” conversion

Abstract

Talk to us

Similar Papers

More From: International Journal of Wavelets, Multiresolution and Information Processing

Lead the way for us

Similar Papers

Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images.
Asghar Ali Chandio ... Mehwish Leghari
Data in Brief | VOL. 31
Asghar Ali Chandio, et. al.Asghar Ali Chandio ... Mehwish Leghari
21 May 2020
Data in Brief | VOL. 31

Scene Text Detection and Localization using Fully Convolutional Network
Mukhriddin Mukhiddinov
-
Mukhriddin MukhiddinovMukhriddin Mukhiddinov
01 Nov 2019
01 Nov 2019

Integrated natural scene text localization and recognition
Kakade Snehal Satwashil ... V.R Pawar
-
Kakade Snehal Satwashil, et. al.Kakade Snehal Satwashil ... V.R Pawar
01 Apr 2017
01 Apr 2017

English text localization and recognition from natural scene image
Kakade Snehal Satwashil ... V R Pawar
-
Kakade Snehal Satwashil, et. al.Kakade Snehal Satwashil ... V R Pawar
01 Jun 2017
01 Jun 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improvement of the end-to-end scene text recognition method for “text-to-speech” conversion

Abstract

Talk to us

Similar Papers

More From: International Journal of Wavelets, Multiresolution and Information Processing