Abstract
As the recognition rates and speeds of optical character recognition (OCR) systems steadily improve, the problem of OCR--and subsequently research interest--is shifting from recognizing: isolated, high-quality characters to reading cursive scripts and degraded documents. In recognizing such texts, a major undertaking is segmenting cursive words into characters and isolating merged characters. In OCR systems that recognize cursive text, the segmentation subsystem becomes the pivotal stage in the system to which a sizable portion of processing is devoted and a considerable share of recognition errors is attributed. The most notable feature of Arabic writing is its cursiveness. It also poses the most difficult problem for recognition algorithms. In this work, we describe the design and implementation of a system that is automatically trainable and that recognizes noisy and cursive words. To recognize a word, the system does not segment it into symbols (character shapes) in advance; rather, it recognizes the input word by detecting a set of shape primitives on the word. It then matches the regions of the word (represented by the detected primitives) to a set of symbol models. A spatial arrangement of symbol models that are matched to regions of the word, then, becomes the description of the recognized word. Since the number of potential arrangements of all symbol models is large, the system imposes a set of word structure and spatial consistency. It searches the space comprised of the arrangements that satisfy the constraints and tries to maximize the a posteriori probability of the symbol-models' arrangement. Large-scale experimentation with the system on isolated characters reveals that it has a recognition rate of 99.7% for synthetically degraded symbols and 94.1% for scanned symbols. Experimentation on isolated words reveals that the system has a recognition rate of 99.4% for noise-free words, 95.6% for synthetically degraded words, and 73% for scanned words. The main theoretical contribution of this work is in laying the foundation for a segmentation-free approach for Arabic word recognition. Recognition is based on maximizing the probability of the word given the detected primitives. The system is designed to minimize training effort and is extensible as training determines the symbols the system recognizes.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal on Document Analysis and Recognition
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.