OCR FOR ENGLISH CHARACTERS BASED ON POLAR HISTOGRAM FEATURE EXTRACTION AND EUCLIDEAN DISTANCE

Saleh Ali Alshehri

doi:10.53370/001c.23733

Abstract

Optical character recognition is the process of converting characters from image format to text format. The process includes four main stages namely: pre-processing, feature extraction, character recognition, and post-processing. The success is mainly based on the feature extraction method and the character recognition algorithm. When optical character recognition systems are developed for mobile devices, two main constraints must be addressed. They are the system’s size and speed. In this research, both were considered. A small feature dataset was created using polar histogram of x and y projections of the character image. In addition, a simple character recognition algorithm based on Euclidean distance was adapted. An accuracy of over 99.9% was achieved in near microsecond scale recognizing execution time on the development system.

Highlights

Optical character recognition (OCR) is the process of character recognition where a character in image pixels format is converted to character text format such as ASCII or Unicode [1], [2], [3]
There are a huge number of features that can be used in OCR systems [6]
A dataset with 1016 fonts for each character was used for feature extraction that was based on the polar histogram of the x and y projections of character image

Summary

Introduction

Optical character recognition (OCR) is the process of character recognition where a character in image pixels format is converted to character text format such as ASCII or Unicode [1], [2], [3]. The process starts by preprocessing the text image that contains characters to prepare it in a form that can be used to extract unique features. Once the features are obtained, the recognition algorithm is chosen and implemented. In many situations, postprocessing such as output formatting may be needed These are the four main OCR system stages [4]. There are three main goals of any OCR system: speed, accuracy, and storage capacity [1]. Because of the storage and processing speed limitation, mobile devices have some constraints on the OCR applications. There are many surveys for general OCR and for language specific OCR [10], [11], [12]

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

OCR FOR ENGLISH CHARACTERS BASED ON POLAR HISTOGRAM FEATURE EXTRACTION AND EUCLIDEAN DISTANCE

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Yanbu Journal of Engineering and Science

Lead the way for us

Journal: Yanbu Journal of Engineering and Science	Publication Date: Dec 1, 2019
License type: CC BY 4.0

Similar Papers

JPEG for Arabic Handwritten Character Recognition: Add a Dimension of Application
Abdurazzag Ali ... Salem Ali
-
Abdurazzag Ali, et. al.Abdurazzag Ali ... Salem Ali
01 Oct 2008
01 Oct 2008

Soft Computing Techniques for Optical Character Recognition Systems
Arindam Chaudhuri ... Krupa Mandaviya
-
Arindam Chaudhuri, et. al.Arindam Chaudhuri ... Krupa Mandaviya
24 Dec 2016
24 Dec 2016

OCR in Indian Scripts: A Survey
Peeta Basa Pati ... A G Ramakrishnan
IETE Technical Review | VOL. 22
Peeta Basa Pati, et. al.Peeta Basa Pati ... A G Ramakrishnan
01 May 2005
IETE Technical Review | VOL. 22

An Investigative Analysis of Different LSTM Libraries for Supervised and Unsupervised Architectures of OCR Training
Syed Saqib Bukhari ... Sumam Francis
-
Syed Saqib Bukhari, et. al.Syed Saqib Bukhari ... Sumam Francis
01 Aug 2018
01 Aug 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

OCR FOR ENGLISH CHARACTERS BASED ON POLAR HISTOGRAM FEATURE EXTRACTION AND EUCLIDEAN DISTANCE

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Yanbu Journal of Engineering and Science