The Optical Character Recognition for Cursive Script Using HMM: A Review

Saeeda Naz,Muhammad M Ajmal,Arif I Umar,Salahuddin Salahuddin,Syed H Shirazi

doi:10.19026/rjaset.8.1193

Saeeda Naz, Muhammad M Ajmal + Show 3 more

Open Access

https://doi.org/10.19026/rjaset.8.1193

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text
Similar Papers

Abstract

Listen

Automatic Character Recognition has wide variety of applications such as automatic postal mail sorting, number plate recognition and automatic form of reader and entering text from PDA's etc. Cursive script's Automatic Character Recognition is a complex process facing unique issues unlike other scripts. Many solutions have been proposed in the literature to solve complexities of cursive scripts character recognition. This paper present a comprehensive literature review of the Optical Character Recognition (OCR) for off-line and on-line character recognition for Urdu, Arabic and Persian languages, based on Hidden Markov Model (HMM). We surveyed all most all significant approaches proposed and concluded future directions of OCR for cursive languages.

Highlights

Optical Character Recognition (OCR) converts text images into text file
We describe the predominant application of Hidden Markov Model (HMM) given by segmentation-free and segmentation based recognition of cursive script for off-line and on-line handwritten ligatures, words or text lines
The proposed system was assessed on the Urdu single character ligatures and attained 98% accuracy rate for the manually generated data and a 96% accuracy rate for the data scanned from several books and magazines

Summary

INTRODUCTION

Optical Character Recognition (OCR) converts text images into text file. The main objective of OCR is to mimic the reading ability of human being with accuracy and high speed. The cursive nature and forms of letter depending on its position to create words are creating challenges for researcher in the segmentation stage of the character recognition. Urdu language unlike Arabic language has some peculiarities due to more alphabet/letters and some unique properties These peculiarities make OCR in Urdu language more complex and challenging. The Nasta’liq font style adds further to challenges because language is written diagonally with no fixed baseline, no standards for slopes, context sensitivity caused by filled or false loops and character/ligature overlaps (Slimane et al, 2012). Diagonality is introduced by Nasta’liq writing style in Urdu that makes this language more complex for researcher in the field of OCR. The intra-ligature and inter ligature overlapping in Urdu text being Arabic based text add to challenges in the segmentation and recognition (Naz 2013). This technique is not applicable to Nasta'liq, where the ligatures overlap in horizontal/vertical projections and display minor spacing among the lines

HMM BASED CURSIVE SCRIPT CHARACTER RECOGNITION

Statistical features

ADAB database

Findings

CONCLUSION

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Research Journal of Applied Sciences, Engineering and Technology	Publication Date: Nov 20, 2014
Citations: 4	License type: cc-by

R Discovery Prime

The Optical Character Recognition for Cursive Script Using HMM: A Review

Abstract

Highlights

Summary

Published Version

Talk to us

Similar Papers

More From: Research Journal of Applied Sciences, Engineering and Technology

Lead the way for us

Similar Papers

Recognition of Printed Text Based on Hidden Markov Model
Ghaydaa Al-Talib ... Armanesa Hasson
AL-Rafidain Journal of Computer Sciences and Mathematics | VOL. 7
Ghaydaa Al-Talib, et. al.Ghaydaa Al-Talib ... Armanesa Hasson
01 Dec 2010
AL-Rafidain Journal of Computer Sciences and Mathematics | VOL. 7

Fundamentals in Handwriting Recognition
-
-
--
01 Jan 1993
01 Jan 1993

Scale and rotation invariant OCR for Pashto cursive script using MDLSTM network
Riaz Ahmad ... Sheikh Faisal Rashid
-
Riaz Ahmad, et. al.Riaz Ahmad ... Sheikh Faisal Rashid
01 Aug 2015
01 Aug 2015

Implementation of number plate detection system for vehicle registration using IOT and recognition using CNN
M.A Jawale ... Nikhil Marriwala
Measurement: Sensors | VOL. 27
M.A Jawale, et. al.M.A Jawale ... Nikhil Marriwala
02 May 2023
Measurement: Sensors | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

The Optical Character Recognition for Cursive Script Using HMM: A Review

Abstract

Highlights

Summary

Published Version

Talk to us

Similar Papers

More From: Research Journal of Applied Sciences, Engineering and Technology