Offline Handwritten Script Identification in Document Images

Mallikarjun Hangarge,B.V Dhandra

doi:10.5120/834-1170

Abstract

Automatic handwritten script identification from document images facilitates many important applications such as sorting, transcription of multilingual documents and indexing of large collection of such images, or as a precursor to optical character recognition (OCR). In this paper, we investigate a texture as a tool for determining the script of handwritten document image, based on the observation that text has a distinct visual texture. Further, K nearest neighbour algorithm is used to classify 300 text blocks as well as 400 text lines into one of the three major Indian scripts: English, Devnagari and Urdu, based on 13 spatial spread features extracted using morphological filters. The proposed algorithm attains average classification accuracy as high as 99.2% for bi-script and 88.6% for tri-script separation at text line and text block level respectively with five fold cross validation test. General Terms Pattern Recognition, Document Image Analysis

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Offline Handwritten Script Identification in Document Images

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Applications

Lead the way for us

Journal: International Journal of Computer Applications	Publication Date: Jul 10, 2010
Citations: 45

Similar Papers

Simulation of Quantum Cellular Automata Circuits Using Neural Networks
E.N Ganesh ... M.J.S Rangachar
-
E.N Ganesh, et. al.E.N Ganesh ... M.J.S Rangachar
01 Dec 2007
01 Dec 2007

Segmentation of Handwritten Document Images into Text Lines
Vassilis Katsouros ... Vassilis Papavassiliou
-
Vassilis Katsouros, et. al.Vassilis Katsouros ... Vassilis Papavassiliou
19 Apr 2011
19 Apr 2011

Neural Networks for Document Image and Text Processing
Joan Pastor Pellicer
-
Joan Pastor PellicerJoan Pastor Pellicer
03 Nov 2017
03 Nov 2017

Spotting Separator Points at Line Terminals in Compressed Document Images for Text-line Segmentation
Amarnath R ... P Nagabhushan
International Journal of Computer Applications | VOL. 172
Amarnath R, et. al.Amarnath R ... P Nagabhushan
17 Aug 2017
International Journal of Computer Applications | VOL. 172

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Offline Handwritten Script Identification in Document Images

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Applications