MSdocTr-Lite: A lite transformer for full page multi-script handwriting recognition

Marwa Dhiaf,Ahmed Cheikh Rouhou,Yousri Kessentini,Sinda Ben Salem

doi:10.1016/j.patrec.2023.03.020

Abstract

The Transformer has quickly become the dominant architecture for various pattern recognition tasks due to its capacity for long-range representation. However, transformers are data-hungry models and need large datasets for training. In Handwritten Text Recognition (HTR), collecting a massive amount of labeled data is a complicated and expensive task. In this paper, we propose a lite transformer architecture for full-page multi-script handwriting recognition. The proposed model comes with three advantages: First, to solve the common problem of data scarcity, we propose a lite transformer model that can be trained on a reasonable amount of data, which is the case of most HTR public datasets, without the need for external data. Second, it can learn the reading order at page-level thanks to a curriculum learning strategy, allowing it to avoid line segmentation errors, exploit a larger context and reduce the need for costly segmentation annotations. Third, it can be easily adapted to other scripts by applying a simple transfer-learning process using only page-level labeled images. Extensive experiments on different datasets with different scripts (French, English, Spanish, and Arabic) show the effectiveness of the proposed model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MSdocTr-Lite: A lite transformer for full page multi-script handwriting recognition

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Mar 24, 2023
Citations: 6

Similar Papers

Semi-supervised learning for cursive handwriting recognition using keyword spotting
V Frinken ... A Fischer
-
V Frinken, et. al.V Frinken ... A Fischer
01 Sep 2012
01 Sep 2012

A Comparative Study on Efficiency of Classification Techniques with Zone Level Gabor Features towards Handwritten Telugu Character Recognition
N Shobha ... Vasudev T
International Journal of Computer Applications | VOL. 148
N Shobha, et. al.N Shobha ... Vasudev T
16 Aug 2016
International Journal of Computer Applications | VOL. 148

Impact of Deep Learning on Localizing and Recognizing Handwritten Text in Lecture Videos
Lakshmi Haritha Medida ... Kasarapu Ramani
International Journal of Advanced Computer Science and Applications | VOL. 12
Lakshmi Haritha Medida, et. al.Lakshmi Haritha Medida ... Kasarapu Ramani
01 Jan 2020
International Journal of Advanced Computer Science and Applications | VOL. 12

Bernoulli HMMs for Handwritten Text Recognition
Adrián Giménez Pastor
-
Adrián Giménez PastorAdrián Giménez Pastor
09 Jun 2014
09 Jun 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MSdocTr-Lite: A lite transformer for full page multi-script handwriting recognition

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters