Line Segmentation Challenges in Tamil Language Palm Leaf Manuscripts

R Spurgen Ratheash*,M Mohamed Sathik

doi:10.35940/ijitee.l3159.119119

Abstract

The process of an Optical Character Recognition (OCR) for ancient hand written documents or palm leaf manuscripts is done by means of four phases. The four phases are ‘line segmentation’, ‘word segmentation’, ‘character segmentation’, and ‘character recognition’. The colour image of palm leaf manuscripts are changed into binary images by using various pre-processing methods. The first phase of an OCR might break through the hurdles of touching lines and overlapping lines. The character recognition becomes futile when the line segmentation is erroneous. In Tamil language palm leaf manuscript recognition, there are only a handful of line segmentation methods. Moreover, the available methods are not viable to meet the required standards. This article is proposed to fill the lacuna in terms of the methods necessary for line segmentation in Tamil language document analysis. The method proposed compares its efficiency with the line segmentation algorithms work on binary images such as the Adaptive Partial Projection (APP) and A* Path Planning (A*PP). The tools and criteria of evaluation metrics are measured from ICDAR 2013 Handwriting Segmentation Contest.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Line Segmentation Challenges in Tamil Language Palm Leaf Manuscripts

Abstract

Talk to us

Similar Papers

More From: International Journal of Innovative Technology and Exploring Engineering

Lead the way for us

Similar Papers

A Review of Various Line Segmentation Techniques Used in Handwritten Character Recognition
Solley Joseph ... Jossy George
-
Solley Joseph, et. al.Solley Joseph ... Jossy George
23 Jun 2022
23 Jun 2022

A performance comparison and post-processing error correction technique to OCRs for printed Tamil texts
M Ramanan ... A Ramanan
-
M Ramanan, et. al.M Ramanan ... A Ramanan
01 Dec 2014
01 Dec 2014

Image Segmentation of Historical Handwriting from Palm Leaf Manuscripts
Olarik Surinta ... Rapeeporn Chamchong
-
Olarik Surinta, et. al.Olarik Surinta ... Rapeeporn Chamchong
19 Oct 2008
19 Oct 2008

A preprocessing method for printed Tamil documents: Skew correction and textual classification
M Ramanan ... A Ramanan
-
M Ramanan, et. al.M Ramanan ... A Ramanan
01 Dec 2015
01 Dec 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Line Segmentation Challenges in Tamil Language Palm Leaf Manuscripts

Abstract

Talk to us

Similar Papers

More From: International Journal of Innovative Technology and Exploring Engineering