Abstract

A good Arabic handwritten recognition system must consider the characteristics of Arabic letters which can be explicit such as the presence of diacritics or implicit such as the baseline information (a virtual line on which cursive text are aligned and/join). In order to find an adequate method of features extraction, we have taken into consideration the nature of the Arabic characters. The paper investigate two methods based on two different visions: one describes the image in terms of the distribution of pixels, and the other describes it in terms of local patterns. Spatial Distribution of Pixels (SDP) is used according to the first vision; whereas Local Binary Patterns (LBP) are used for the second one. Tested on the Arabic portion of the Isolated Farsi Handwritten Character Database (IFHCDB) and using neural networks as a classifier, SDP achieve a recognition rate around 94% while LBP achieve a recognition rate of about 96%.

Highlights

  • Today even with the emergence of new technologies, people still use the paper as a physical medium of communication and information storage

  • Intelligent Character Recognition (ICR) systems allow the conversion of handwritten documents into electronic version, while Optical Character Recognition (OCR) systems deal with printed documents

  • The database used in this work, is the same used in the ICDAR 2009 competition [14] under the name of Isolated Farsi Handwritten Character Database (IFHCDB) [15], it contains 52380 characters and 17740 numbers

Read more

Summary

Introduction

Today even with the emergence of new technologies, people still use the paper as a physical medium of communication and information storage. There are several applications which use ICR, such as document digitization, storing, retrieving and indexing, automatic mail sorting, processing of bank checks and processing of forms. The importance of these applications has leads to intensive research for several years. Three different letters ‘ba’,’taa’,’thaa’ (‫ب‬, ‫ت‬, ‫)ث‬ have the same basic shape but the position and the number of points are different (fig.2) All these facts make the Arabic script more challenging than other script like Latin. The rest of the paper is organized as follows: The section, examines some related works on isolated Arabic character recognition. International Journal of Interactive Multimedia and Artificial Intelligence, Vol 4, No4

Related Works
Dataset
Preprocessing
Classification
Feature extraction
3) Summary
Local Binary Pattern
Comparative Analysis
Findings
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.