Few shots are all you need: A progressive learning approach for low resource handwritten text recognition

Mohamed Ali Souibgui,Yousri Kessentini,Beáta Megyesi,Alicia Fornés

doi:10.1016/j.patrec.2022.06.003

Abstract

Handwritten text recognition in low resource scenarios, such as manuscripts with rare alphabets, is a challenging problem. The main difficulty comes from the very few annotated data and the limited linguistic information (e.g. dictionaries and language models). Thus, we propose a few-shot learning-based handwriting recognition approach that significantly reduces the human labor annotation process, requiring only few images of each alphabet symbol. The method consists in detecting all the symbols of a given alphabet in a textline image and decoding the obtained similarity scores to the final sequence of transcribed symbols. Our model is first pretrained on synthetic line images generated from any alphabet, even though different from the target domain. A second training step is then applied to diminish the gap between the source and target data. Since this retraining would require annotation of thousands of handwritten symbols together with their bounding boxes, we propose to avoid such human effort through an unsupervised progressive learning approach that automatically assigns pseudo-labels to the non-annotated data. The evaluation on different manuscript datasets show that our model can lead to competitive results with a significant reduction in human effort. The code will be publicly available in this repository: \url{https://github.com/dali92002/HTRbyMatching}

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Pattern Recognition Letters	Publication Date: Aug 1, 2022
Citations: 18	License type: cc-by

R Discovery Prime

R Discovery Prime

Few shots are all you need: A progressive learning approach for low resource handwritten text recognition

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Similar Papers

Crew scheduling of light rail transit in Hong Kong: from modeling to implementation
Sydney C.K Chu ... Edmond C.H Chan
Computers & Operations Research | VOL. 25
Sydney C.K Chu, et. al.Sydney C.K Chu ... Edmond C.H Chan
01 Nov 1998
Computers & Operations Research | VOL. 25

Can Pretrained English Language Models Benefit Non-English NLP Systems in Low-Resource Scenarios?
Zewen Chi ... Heyan Huang
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 32
Zewen Chi, et. al.Zewen Chi ... Heyan Huang
01 Jan 2024
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 32

Triple Steps for Verifying Chemical Reaction Based on Deep Whale Optimization Algorithm (VCR-WOA)
Samaher Al-Janabi ... Ayad Alkaim
-
Samaher Al-Janabi, et. al.Samaher Al-Janabi ... Ayad Alkaim
26 May 2022
26 May 2022

Multi Modal 2-D Canvas Based Gallery Content Retrieval
Pragya Paramita Sahu ... Viswanath Veera
-
Pragya Paramita Sahu, et. al.Pragya Paramita Sahu ... Viswanath Veera
01 Jan 2023
01 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Few shots are all you need: A progressive learning approach for low resource handwritten text recognition

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters