Abstract

This paper presents a deep learning based intelligent text recognition system with touching and overlapped characters. The robustness and effectiveness in the proposed model are enhanced through the modified configuration of neural network known as Deep Wavelet Neural Network (DWNN). The capability of deep learning networks to learn efficiently from an unlabeled dataset has attracted the attention of many researchers over the last decade. However, the performance of these networks is subject to the quality of the dataset and invariant image representation. Numerous optical character recognition techniques have also been presented in the recent years, but the overlapped and touching characters have not been addressed much. The nonlinear and uncertain representation of image data in case of overlapped text adds severe complexity in the process of feature extraction and respective learning. The proposed architecture of DWNN uses fast decaying wavelet functions as activation function in place of conventional sigmoid function to cope up with the uncertainties and nonlinearity of the data representation in overlapped text images. It comprises of cascaded layered architecture of translated and dilated versions of wavelets as activation functions for the training and feature extraction at multiple levels. The local transformation and deformation variation in the visual data has also been taken care efficiently through the modified architecture of DWNN. Comprehensive experimental analysis has been performed over various test images to verify the effectiveness of the proposed text recognition system. The performance of the proposed method is assessed with the help of the metrics, namely, estimation error, cost function and accuracy. The proposed approach will be implemented in MATLAB.

Highlights

  • The field of optical character recognition has attracted a lot of attention over the last two decades due to its capability to extract the meaningful information from the printed or handwritten text

  • This paper presents an intelligent and robust deep learning framework, Deep Wavelet Neural Network (DWNN) for the text extraction from the images with overlapped and touching characters

  • The performance of the proposed DWNN based text recognition with overlapped characters is assessed through the experimental analysis in MATLAB

Read more

Summary

Introduction

The field of optical character recognition has attracted a lot of attention over the last two decades due to its capability to extract the meaningful information from the printed or handwritten text. It has been used successfully in the applications like automatic language translation, text to speech converters, smart scanning devices, text summarization, automated postal address and ZIP code reading, bank cheque reading, etc. The intended information is extracted from the images based on a thorough analysis of the text and graphical features of the document. A typical framework of OCR involves the process of preprocessing, segmentation, feature extraction and recognition. Various techniques ranging from statistical models to deep learning framework have been proposed for the text recognition based on the characteristics of the features of documents [5,6,7,8]

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.