Abstract
Handwritten word recognition is an active research area due to numerous commercial applications in offline and online recognition systems. The diversity and complexity of Persian handwritten words makes them more difficult to recognize. In current methods, discriminative features are manually extracted from images by humans so their performance depends on human creativity. This process is called shallow learning. In this study, deep Convolutional Neural Networks (CNNs), a widely used type of deep learning, is employed to automatically extract the discriminative features. Deep learning is able to discover complex structure (discriminative feature here) in large datasets. First in the proposed method, a preprocessing algorithm converts the images to equal size while maintaining handwritten words structure. Then, the images are given to two different architectures of CNNs, AlexNet and GoogLeNet with and without batch normalization. Finally, the proposed method is evaluated on “IRANSHAHR” dataset which includes 15383 images of 503 different city names of Iran. Experimental results show that GoogLeNet with preprocessed data and batch normalization achieves higher accuracy (99.13%) and outperforms the current methods.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.