Abstract
Handwriting recognition refers to recognizing a handwritten input that includes character(s) or digit(s) based on an image. Because most applications of handwriting recognition in real life contain sequential text in various languages, there is a need to develop a dynamic handwriting recognition system. Inspired by the neuroevolutionary technique, this paper proposes a Dynamically Configurable Convolutional Recurrent Neural Network (DC-CRNN) for the handwriting recognition sequence modeling task. The proposed DC-CRNN is based on the Salp Swarm Optimization Algorithm (SSA), which generates the optimal structure and hyperparameters for Convolutional Recurrent Neural Networks (CRNNs). In addition, we investigate two types of encoding techniques used to translate the output of optimization to a CRNN recognizer. Finally, we proposed a novel hybridized SSA with Late Acceptance Hill-Climbing (LAHC) to improve the exploitation process. We conducted our experiments on two well-known datasets, IAM and IFN/ENIT, which include both the Arabic and English languages. The experimental results have shown that LAHC significantly improves the SSA search process. Therefore, the proposed DC-CRNN outperforms the handcrafted CRNN methods.
Highlights
Handwritten text recognition was one of the first problems that artificial intelligence attempted to solve
This study presented a dynamically configurable Convolutional Recurrent Neural Networks (CRNNs) (DC-CRNN) for text sequence modeling
The CRNN’s dynamic configuration was achieved using an Salp Swarm Algorithm (SSA) hybridized with Late Acceptance Hill-Climbing (LAHC)
Summary
Handwritten text recognition was one of the first problems that artificial intelligence attempted to solve. Many systems and mobile applications have been developed to perceive and comprehend their visual surroundings using handwritten text recognition techniques [2,3,4,5] The goal of these systems is to read street signs and allow easier automated navigation [6,7], search and index a large number of images or videos on the Internet [8,9,10], detect product labels for autonomous stores [11] or help in real-time text recognition of translations on smartphones [12]. These systems focus on deep learning techniques due to their rapid development [13]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.