Abstract

Script identification plays an important role in document image processing especially for multilingual environment. This paper hires two conventional textural methods for recognition of the scripts of the handwritten documents inscribed in different Indic scripts. The first method extracts well-known Haralick features from spatial grey-level dependence matrix (SGLDM) and the second method computes fractal dimension by using segmentation-based fractal texture analysis (SFTA). Finally, a 104-element feature vector is constructed from each page image by these two methods. The proposed technique is then evaluated on a total dataset comprising 360 handwritten document pages written in 12 Indian official scripts namely Bangla, Devanagari, Gujarati, Gurumukhi, Kannada, Malayalam, Manipuri, Oriya, Tamil, Telugu, Urdu and Roman. Experimentations using multiple classifiers reveal that multilayer perceptron (MLP) shows the highest identification accuracy of 96.94%. The encouraging outcome confirms the efficacy of customary textural features to handwritten Indic script identification.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call