Deep optical character recognition: a case of Pashto language

Shizza Zahoor,Naila H Khan,Saeeda Naz,Muhammad I Razzak

doi:10.1117/1.jei.29.2.023002

Abstract

Over the past decades, text recognition technologies have focused immensely on noncursive isolated scripts. A text recognition system for the cursive Pashto script will serve as a great contribution, allowing the traditional, cultural, and educational Pashto literature to be converted into machine-readable form. We propose the use of deep learning architectures based on the transfer learning for the recognition of Pashto ligatures. For recognition analysis and evaluation, the ligature images in the dataset are preprocessed by data augmentation techniques, i.e., negatives, contours, and rotated to increase the variation of each sample and size of the original dataset. Rich feature representations are automatically extracted from the Pashto ligature images using deep convolution layers of the convolution neural network (CNN) architectures using fine-tuned approach. Pretrained CNN architectures: AlexNet, GoogleNet, and VGG (VGG-16 and VGG-19) are used for classification by feeding the extracted features to a fully connected layer and a softmax layer. The proposed deep transfer-based learning has achieved phenomenal recognition rates for Pashto ligatures on benchmark FAST-NU Pashto dataset. An accuracy of 97.24%, 97.46%, and 99.03% is achieved using AlexNext, GoogleNet, and VGGNet architectures, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep optical character recognition: a case of Pashto language

Abstract

Talk to us

Similar Papers

More From: Journal of Electronic Imaging

Lead the way for us

Journal: Journal of Electronic Imaging	Publication Date: Mar 4, 2020
Citations: 6

Similar Papers

Human Activity Recognition in a Realistic and Multiview Environment Based on Two-Dimensional Convolutional Neural Network
Ashish Khare ... Arati Kushwaha
Journal of Artificial Intelligence and Technology | VOL. -
Ashish Khare, et. al. Ashish Khare ... Arati Kushwaha
09 May 2023
Journal of Artificial Intelligence and Technology | VOL. -

Texture Patterns for Object Recognition and Content-Based Color Image Retrieval

-

21 Dec 2020
21 Dec 2020

Convolutional Neural Network for Machine-Printed Traditional Mongolian Font Recognition
Hongxi Wei ... Ya Wen
-
Hongxi Wei, et. al.Hongxi Wei ... Ya Wen
01 Jan 2018
01 Jan 2018

A novel study for automatic two-class COVID-19 diagnosis (between COVID-19 and Healthy, Pneumonia) on X-ray images using texture analysis and 2-D/3-D convolutional neural networks.
Huseyin Yaşar ... Murat Ceylan
Multimedia systems | VOL. 37
Huseyin Yaşar, et. al.Huseyin Yaşar ... Murat Ceylan
29 Jan 2022
Multimedia systems | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep optical character recognition: a case of Pashto language

Abstract

Talk to us

Similar Papers

More From: Journal of Electronic Imaging