Determination of optimal features database for OCR of printed Telugu text

C Vasantha Lakshmi,C Patvardhan,Sarika Singh

doi:10.1109/natsys.2015.7489112

Abstract

OCR (Optical Character Recognition) systems are being developed due to their numerous applications even for Indian scripts like Telugu which are complicated due to the usage of a large number of symbols. OCR systems typically store pre-computed features of symbols to be recognized in a database. Recognition of an unknown symbol is performed by finding the symbol in the database that is nearest in features space. Design of an appropriate database is, therefore, a critical step. This is especially so when the OCR system targets recognition of numerous symbols in multiple fonts and sizes. The idea is to develop an OCR system that has small recognition times and high recognition accuracies. The naive approach of putting features of all symbols in all fonts and sizes in the database might be counterproductive on both counts. Experimental results on text document images with multiple fonts and sizes show that the strategy for database design for OCR of printed Telugu text proposed in this paper achieves both the objectives. This is the first reported approach for such a database design for Telugu OCR.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Determination of optimal features database for OCR of printed Telugu text

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Soft Computing Techniques for Optical Character Recognition Systems
Arindam Chaudhuri ... Pratixa Badelia
-
Arindam Chaudhuri, et. al.Arindam Chaudhuri ... Pratixa Badelia
24 Dec 2016
24 Dec 2016

JPEG for Arabic Handwritten Character Recognition: Add a Dimension of Application
Abdurazzag Ali ... Salem Ali
-
Abdurazzag Ali, et. al.Abdurazzag Ali ... Salem Ali
01 Oct 2008
01 Oct 2008

OmniPage vs. Sakhr: paired model evaluation of two Arabic OCR products
Tapas Kanungo ... Daniel P Lopresti
-
Tapas Kanungo, et. al.Tapas Kanungo ... Daniel P Lopresti
07 Jan 1999
07 Jan 1999

OCR in Indian Scripts: A Survey
Peeta Basa Pati ... A G Ramakrishnan
IETE Technical Review | VOL. 22
Peeta Basa Pati, et. al.Peeta Basa Pati ... A G Ramakrishnan
01 May 2005
IETE Technical Review | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Determination of optimal features database for OCR of printed Telugu text

Abstract

Talk to us

Similar Papers