Abstract

Handwriting recognition refers to recognizing a handwritten input that includes character(s) or digit(s) based on an image. Because most applications of handwriting recognition in real life contain sequential text in various languages, there is a need to develop a dynamic handwriting recognition system. Inspired by the neuroevolutionary technique, this paper proposes a Dynamically Configurable Convolutional Recurrent Neural Network (DC-CRNN) for the handwriting recognition sequence modeling task. The proposed DC-CRNN is based on the Salp Swarm Optimization Algorithm (SSA), which generates the optimal structure and hyperparameters for Convolutional Recurrent Neural Networks (CRNNs). In addition, we investigate two types of encoding techniques used to translate the output of optimization to a CRNN recognizer. Finally, we proposed a novel hybridized SSA with Late Acceptance Hill-Climbing (LAHC) to improve the exploitation process. We conducted our experiments on two well-known datasets, IAM and IFN/ENIT, which include both the Arabic and English languages. The experimental results have shown that LAHC significantly improves the SSA search process. Therefore, the proposed DC-CRNN outperforms the handcrafted CRNN methods.

Highlights

  • Handwritten text recognition was one of the first problems that artificial intelligence attempted to solve

  • This study presented a dynamically configurable Convolutional Recurrent Neural Networks (CRNNs) (DC-CRNN) for text sequence modeling

  • The CRNN’s dynamic configuration was achieved using an Salp Swarm Algorithm (SSA) hybridized with Late Acceptance Hill-Climbing (LAHC)

Read more

Summary

Introduction

Handwritten text recognition was one of the first problems that artificial intelligence attempted to solve. Many systems and mobile applications have been developed to perceive and comprehend their visual surroundings using handwritten text recognition techniques [2,3,4,5] The goal of these systems is to read street signs and allow easier automated navigation [6,7], search and index a large number of images or videos on the Internet [8,9,10], detect product labels for autonomous stores [11] or help in real-time text recognition of translations on smartphones [12]. These systems focus on deep learning techniques due to their rapid development [13]

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call