Deep Learning Analysis in Development of Handwritten and Plain Text Classification API

Danny Gani,Maria Lamury,Kho I Eng,Maulahikmah Galinium,James Purnama

doi:10.1145/3557738.3557852

Abstract

Optical Character Recognition (OCR) and Handwritten Text Recognition (HTR) are technologies that enable text recognition. The difference between OCR and HTR is one designed specifically for digital text and one designed for handwritten text. There are already various implementations of OCR and HTR online. However, such systems do not guarantee the systems are in premises. To solve this problem, the OCR and HTR system must be built from the scratch. The purpose of this research is to improve the recognition by separating the text whether it is a handwritten or a printed text, which will later be forwarded into the appropriate recognition system. An application program interface (API) was also created in order to finalize the classification system into real world usage. In this research, the classification system being developed using convolutional neural network (CNN) method. To be able to reach the highest accuracy of the classification system, the experimentation and improvement about hyperparameters, dataset format, data augmentation and analysis on 3 CNN architectures were conducted. In the end of this research, there are 2 architectures in a tight competition, one is VGG-16 with 90.63% accuracy and one is AlexNet with 90.17% accuracy on ideal data testing. However, AlexNet is chosen as the winner after the testing with real data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Learning Analysis in Development of Handwritten and Plain Text Classification API

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Data Augmentation for Offline Handwritten Text Recognition: A Systematic Literature Review
Arthur Flor De Sousa Neto ... Alejandro Héctor Toselli
SN Computer Science | VOL. 5
Arthur Flor De Sousa Neto, et. al.Arthur Flor De Sousa Neto ... Alejandro Héctor Toselli
10 Feb 2024
SN Computer Science | VOL. 5

Study of the influence of lexicon and language restrictions on computer assisted transcription of historical manuscripts
Emilio Granell ... Carlos-D Martínez-Hinarejos
Neurocomputing | VOL. 390
Emilio Granell, et. al.Emilio Granell ... Carlos-D Martínez-Hinarejos
23 Jan 2020
Neurocomputing | VOL. 390

Machine Learning Tensor Flow Based Platform for Recognition of Hand Written Text
Nitin Gupta ... Neha Goyal
-
Nitin Gupta, et. al.Nitin Gupta ... Neha Goyal
27 Jan 2021
27 Jan 2021

Impact of Deep Learning on Localizing and Recognizing Handwritten Text in Lecture Videos
Lakshmi Haritha Medida ... Kasarapu Ramani
International Journal of Advanced Computer Science and Applications | VOL. 12
Lakshmi Haritha Medida, et. al.Lakshmi Haritha Medida ... Kasarapu Ramani
01 Jan 2020
International Journal of Advanced Computer Science and Applications | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Learning Analysis in Development of Handwritten and Plain Text Classification API

Abstract

Talk to us

Similar Papers