Abstract

Numeral recognition is treated as a benchmark research problem as this is a basic module for designing a comprehensive optical character recognition system. In this context, unconstrained handwritten numeral recognition is still considered as an open research problem. Most of the feature descriptors found in the literature for the said problem, work well for numeral images written in a particular language. To encounter this shortcoming, in this paper, we have proposed two shape-based feature descriptors, namely Point-Light Source-based Shadow (PLSS) and Histogram of Oriented Pixel Positions (HOPP). We have evaluated the proposed feature descriptors on 10 (9 offline and 1 online) publicly available standard handwritten numeral image datasets written in eight different languages. Besides, to prove the usefulness of the descriptors in real-life scenario, we have considered numeral string images also. We have also shown how the proposed feature descriptors are invariant toward broken, noisy and rotated numeral images. Experimental outcomes soundly prove that the proposed feature descriptors have the ability to estimate the shape of a numeral image almost accurately irrespective of the language in which it is written. Comparison of the proposed feature descriptors with other shape-based as well as texture-based features shows that PLSS and HOPP produce the results which are analogous to state of the art. The code of the proposed feature descriptors can be found at— https://github.com/ghoshsoulib/Numeral-Recognition .

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call