Recognition of Devanagari Scene Text Using Autoencoder CNN

Sankirti Sandeep Shiravale,Sanjeev S Sannakki,Jayadevan R

doi:10.5565/rev/elcvia.1344

Sankirti Sandeep Shiravale, Sanjeev S Sannakki + Show 1 more

Open Access

https://doi.org/10.5565/rev/elcvia.1344

Copy DOI

Abstract

Scene text recognition is a well-rooted research domain covering a diverse application area. Recognition of scene text is challenging due to the complex nature of scene images. Various structural characteristics of the script also influence the recognition process. Text and background segmentation is a mandatory step in the scene text recognition process. A text recognition system produces the most accurate results if the structural and contextual information is preserved by the segmentation technique. Therefore, an attempt is made here to develop a robust foreground/background segmentation(separation) technique that produces the highest recognition results. A ground-truth dataset containing Devanagari scene text images is prepared for the experimentation. An encoder-decoder convolutional neural network model is used for text/background segmentation. The model is trained with Devanagari scene text images for pixel-wise classification of text and background. The segmented text is then recognized using an existing OCR engine (Tesseract). The word and character level recognition rates are computed and compared with other existing segmentation techniques to establish the effectiveness of the proposed technique.

Full Text