Abstract
Achieving high accuracies in recognition of handwritten text is a challenging research problem and never exhausting. The factors that instill challenges in handwritten character recognition include high degree of variability in writing, script type and the type of documents etc. In this paper, we focus on recognition of handwritten Telugu text commonly found in document images. The character set include all the vowels, consonants and single level vowel consonant clusters chosen in accordance with the commonly used terminology employed in composition pre-printed documents such as admission forms. In this paper, an algorithm is devised that performs zone-based feature extraction of the segmented character images. In the proposed work, the Gabor features are extracted from the character image at zone level and its efficiency is evaluated individually on two different zone representations and entire image at various scales and orientations. The classification and recognition performance is analyzed using nearest neighbor classifier, Naive Bayesian, multi-class SVM and probabilistic neural networks classifiers. The efficiency of the classifiers are also tested with statistical, Histogram of Gradients and Hu moments‟ feature extraction methods and the best accuracy of the system is found to be 84.8% for Gabor features with zone representation 2 and with multi-class SVM classifier.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.