Text-DIAE: A Self-Supervised Degradation Invariant Autoencoder for Text Recognition and Document Enhancement

Mohamed Ali Souibgui,Dimosthenis Karatzas,Andres Mafla,Ali Furkan Biten,Josep Lladós,Lluis Gomez,Alicia Fornés,Yousri Kessentini,Sanket Biswas

doi:10.1609/aaai.v37i2.25328

Abstract

In this paper, we propose a Text-Degradation Invariant Auto Encoder (Text-DIAE), a self-supervised model designed to tackle two tasks, text recognition (handwritten or scene-text) and document image enhancement. We start by employing a transformer-based architecture that incorporates three pretext tasks as learning objectives to be optimized during pre-training without the usage of labelled data. Each of the pretext objectives is specifically tailored for the final downstream tasks. We conduct several ablation experiments that confirm the design choice of the selected pretext tasks. Importantly, the proposed model does not exhibit limitations of previous state-of-the-art methods based on contrastive losses, while at the same time requiring substantially fewer data samples to converge. Finally, we demonstrate that our method surpasses the state-of-the-art in existing supervised and self-supervised settings in handwritten and scene text recognition and document image enhancement. Our code and trained models will be made publicly available at https://github.com/dali92002/SSL-OCR

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Text-DIAE: A Self-Supervised Degradation Invariant Autoencoder for Text Recognition and Document Enhancement

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 5

Similar Papers

Scene text detection and recognition with advances in deep learning: a survey
Xiyan Liu ... Chunhong Pan
International Journal on Document Analysis and Recognition (IJDAR) | VOL. 22
Xiyan Liu, et. al.Xiyan Liu ... Chunhong Pan
27 Mar 2019
International Journal on Document Analysis and Recognition (IJDAR) | VOL. 22

Research on Text Recognition of Natural Scenes for Complex Situations
Wenhua Yu ... Askar Hamdulla
-
Wenhua Yu, et. al.Wenhua Yu ... Askar Hamdulla
22 Jul 2022
22 Jul 2022

Occluded Text Detection and Recognition in the Wild
Zobeir Raisi ... John Zelek
-
Zobeir Raisi, et. al.Zobeir Raisi ... John Zelek
01 May 2022
01 May 2022

STV2k
Pingping Xiao ... Hanzi Wang
-
Pingping Xiao, et. al.Pingping Xiao ... Hanzi Wang
19 Aug 2016
19 Aug 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text-DIAE: A Self-Supervised Degradation Invariant Autoencoder for Text Recognition and Document Enhancement

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence