Abstract

In daily life, the need of automatically digitizing paper documentations and recognizing textual images is still present with existing and potential upcoming rooms for improvements, especially for languages like Arabic, which is unlike English as an instance, has more complex context and not been extensively supported by research in a such domain. As yet, the available online offline optical character recognition (OCR) systems have utilized functional techniques and achieved high performance mainly on machine printed data images. However, in case of handwritten script, the recognition task becomes highly unconstrained and much more challenging. Amongst a large verity of recognizable multi-lingual characters, handwritten digit recognition is a considerably useful task for different purposes and countless applications. In this research, the focus is on Arabic (known today as Indic or Indian) digit recognition using different proposed Gabor-based approaches in several combinations with different classification methods. The proposed approaches are trained and tested using 91120 digit samples of two independent standard databases (Arabic-Handwritten-Digits and AHDBase), allowing performance variability assessments and comparisons not only between the different combinations of features and classifiers but also between different datasets. The proposed Arabic-Indic digit recognition system achieves high recognition rates reach up to 99.87%. This research practically shows that one of the proposed approaches with significant dimensionality reduced features remains attaining a high recognition rate with low complexity time, which can be hence recommended further for online digit recognition systems.

Highlights

  • Continuous advances in the realm of hardware and software have paved the way for numerous nowadays impressive and effective technologies

  • Even handheld devices like smartphones and tablets have emerged as new platforms for plenty functional Optical character recognition (OCR) apps exploiting their built-in cameras for scanning and other hardware for required processing and operation, so they can be usable in almost any ambient environments

  • We propose two Gabor-based approaches App-1 and App-2 using different feature subset selection algorithms for Arabic-Indic handwritten digit recognition system

Read more

Summary

INTRODUCTION

Continuous advances in the realm of hardware and software have paved the way for numerous nowadays impressive and effective technologies. The focus in this research is on recognizing handwritten Arabic digits known today as Indic or Indian digits This recognition capability is very useful and can be utilized in many applications in a variety of systems and purposes. A proposed system for handwritten Arabic-Indic digit recognition in offline mode with a recommendation for a potential online counterpart for future; different proposed Gabor-based approaches using mainly two algorithmic methods for feature selection and dimensionality reduction; an extended analysis from different aspects for performance assessment and comparison between different combinations of features and classifiers applied on two independent standard large-scale databases; and an investigation for the effects of using the same proposed approaches under the change of database, enabling performance variability evaluation, validation, and comparison.

GABOR FILTERS
ARABIC-INDIC DIGIT DATABASES
Gabor-based Feature Extraction
CLASSIFICATION
EXPERIMENTS AND ANALYSES
Findings
CONCLUSIONS AND DISCUSSIONS
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.