Abstract

Automatic Arabic handwritten recognition is one of the recently studied problems in the field of Machine Learning. Unlike Latin languages, Arabic is a Semitic language that forms a harder challenge, especially with the variability of patterns caused by factors such as the writer’s age. Most studies have focused on adults, with only one recent study on children. Moreover, many recent machine-learning methods have focused on using Convolutional Neural Networks (CNNs), a powerful class of neural networks that can extract complex features from images. In this paper, we propose a convolutional neural network (CNN) model that recognizes children’s handwriting with an accuracy of 91% on the Hijja dataset, a recent dataset built by collecting images of Arabic characters written by children, and 97% on the Arabic Handwritten Character Dataset. The results showed a good improvement over the proposed model from the Hijja dataset authors, yet it revealed a bigger challenge to solve for children’s Arabic handwritten character recognition. Moreover, we proposed a new approach using multiple models instead of a single model based on the number of strokes in a character and merged Hijja with AHCD, which achieved an average prediction accuracy of 96%.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call