Fine-Tuned Pre-Trained Model for Script Recognition

Mamta Bisht,Richa Gupta

doi:10.33889/ijmems.2021.6.5.078

Abstract

Script recognition is the first necessary preliminary step for text recognition. In the deep learning era, for this task two essential requirements are the availability of a large labeled dataset for training and computational resources to train models. But if we have limitations on these requirements then we need to think of alternative methods. This provides an impetus to explore the field of transfer learning, in which the previously trained model knowledge established in the benchmark dataset can be reused in another smaller dataset for another task, thus saving computational power as it requires to train only less number of parameters from the total parameters in the model. Here we study two pre-trained models and fine-tune them for script classification tasks. Firstly, the VGG-16 pre-trained model is fine-tuned for publically available CVSI-15 and MLe2e datasets for script recognition. Secondly, a well-performed model on Devanagari handwritten characters dataset has been adopted and fine-tuned for the Kaggle Devanagari numeral dataset for numeral recognition. The performance of proposed fine-tune models is related to the nature of the target dataset as similar or dissimilar from the original dataset and it has been analyzed with widely used optimizers.

Highlights

Script identification in documents and scene images is an essential starting point for text recognition under multi-lingual scenarios
Two pre-trained models are taken, first pre-trained model is VGG-16 trained on ImageNet dataset and second pre-trained model is trained on Devanagari handwritten characters dataset (DHCD)
First model is fine-tuned for competition on video script identification (CVSI)-15 and MLe2e datasets which is different from the original dataset for script classification tasks

Summary

Introduction

Script identification in documents and scene images is an essential starting point for text recognition under multi-lingual scenarios. As deep learning model requires high computational power and a very large dataset for training and if we have less computational power (or fewer resources) and less dataset for training the performance of trained model is affected by low performance on real world test data To overcome this issue we may try to use weights of well-trained model on very large benchmark dataset such as ImageNet which consists millions of images for classification task. This gives the motivation to explore transfer learning area where knowledge of pre-trained model can be used on another dataset for another task and this saves computational power and requirement of very large dataset. The description of pre-trained models used are as follows: VGG block 1 (freeze)

Pre-Trained VGG-16 Model

Pre-Trained Model on DHCH

Method

Findings

Conclusions and Future Work

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Mathematical, Engineering and Management Sciences	Publication Date: Oct 1, 2021
Citations: 5	License type: cc-by

R Discovery Prime

R Discovery Prime

Fine-Tuned Pre-Trained Model for Script Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Mathematical, Engineering and Management Sciences

Lead the way for us

Similar Papers

Learning without forgetting by leveraging transfer learning for detecting COVID-19 infection from CT images
Malliga Subramanian ... Kogilavani Shanmugavadivel
Scientific Reports | VOL. 13
Malliga Subramanian, et. al.Malliga Subramanian ... Kogilavani Shanmugavadivel
25 May 2023
Scientific Reports | VOL. 13

Expanding Large Pre-trained Unimodal Models with Multimodal Information Injection for Image-Text Multimodal Classification
Tao Liang ... Mingyang Wan
-
Tao Liang, et. al.Tao Liang ... Mingyang Wan
01 Jun 2022
01 Jun 2022

What Is the Intended Usage Context of This Model? An Exploratory Study of Pre-Trained Models on Various Model Repositories
Lina Gong ... Mingqiang Wei
ACM Transactions on Software Engineering and Methodology | VOL. 32
Lina Gong, et. al.Lina Gong ... Mingqiang Wei
03 May 2023
ACM Transactions on Software Engineering and Methodology | VOL. 32

Introducing Various Semantic Models for Amharic: Experimentation and Evaluation with Multiple Tasks and Datasets
Seid Muhie Yimam ... Chris Biemann
Future internet | VOL. 13
Seid Muhie Yimam, et. al.Seid Muhie Yimam ... Chris Biemann
27 Oct 2021
Future internet | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fine-Tuned Pre-Trained Model for Script Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Mathematical, Engineering and Management Sciences