Deep Extreme Learning Machine-Based Optical Character Recognition System for Nastalique Urdu-Like Script Languages

Syed Saqib Raza Rizvi,Sagheer Abbas,Nida Anwer,Muhammad Adnan Khan,Muhammad Asadullah,Areej Fatima

doi:10.1093/comjnl/bxaa042

Abstract

Abstract Optical character recognition systems convert printed or handwritten scripts into digital text formats like ASCII or UNICODE. Urdu-like script languages like Urdu, Punjabi and Sindhi are widely spoken languages of the world, especially in Asia. An enormous amount of printed and handwritten text of such languages exist, which needs to be converted into computer-understandable formats for knowledge extraction. In this study, extreme learning machine’s (ELM’s) most recently proposed variant called deep extreme learning machine (DELM)-based optical character recognition (OCR) system is proposed to enhance Urdu-like script language’s character recognition rate. The proposed DELM-based character recognition model is optimizing the OCR process by reducing the overhead of Pre-processing, Segmentation and Feature Extraction Layer. The proposed system evaluations accomplished 98.75% training accuracy with 1.492 × 10−3 RMSE and 98.12% testing accuracy with 1.587 × 10−3 RMSE, with six DELM hidden layers. The results show that the proposed system has attained the foremost recognition rate as compared to any previously proposed Urdu-like script language OCR system. This technique is applicable for machine-printed text and fractionally useful for handwritten text as well. This study will aid in the advancement of more accurate Urdu-like script OCR’s software systems in the future.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Extreme Learning Machine-Based Optical Character Recognition System for Nastalique Urdu-Like Script Languages

Abstract

Talk to us

Similar Papers

More From: The Computer Journal

Lead the way for us

Journal: The Computer Journal	Publication Date: Jun 20, 2020
Citations: 10

Similar Papers

Optical Character Recognition System for Nastalique Urdu-Like Script Languages Using Supervised Learning
S S R Rizvi ... A Sagheer
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 33
S S R Rizvi, et. al.S S R Rizvi ... A Sagheer
01 Sep 2019
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 33

OmniPage vs. Sakhr: paired model evaluation of two Arabic OCR products
Tapas Kanungo ... Daniel P Lopresti
-
Tapas Kanungo, et. al.Tapas Kanungo ... Daniel P Lopresti
07 Jan 1999
07 Jan 1999

Soft Computing Techniques for Optical Character Recognition Systems
Arindam Chaudhuri ... Soumya K Ghosh
-
Arindam Chaudhuri, et. al.Arindam Chaudhuri ... Soumya K Ghosh
24 Dec 2016
24 Dec 2016

JPEG for Arabic Handwritten Character Recognition: Add a Dimension of Application
Abdurazzag Ali ... Salem Ali
-
Abdurazzag Ali, et. al.Abdurazzag Ali ... Salem Ali
01 Oct 2008
01 Oct 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Extreme Learning Machine-Based Optical Character Recognition System for Nastalique Urdu-Like Script Languages

Abstract

Talk to us

Similar Papers

More From: The Computer Journal