Abstract

Offline Persian handwriting recognition is a challenging task due to the cursive nature of the Persian scripts and similarity among the Persian alphabet letters. This paper presents a Persian handwritten word recognizer based on a sequence labeling method with deep convolutional neural networks (CNN) and recurrent neural networks (RNN). In addition, a connectionist temporal classification (CTC) loss function is utilized in order to eliminate the segmentation step required in conventional methods. The CNN layers are employed to extract the sequence of features from the word image. Altogether, the RNN layer with CTC function is used for labeling the input sequence. We showed that this combination is a robust recognizer for the Persian language as it was for other fields of application such as scene text recognition. The method is tested on the popular Persian and Arabic datasets including IFN/ENIT. It also is compared with novel methods and promising results have been obtained particularly in comparison with the conventional approaches including HMM and other machine learning-based methods.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call