LAL: Linguistically Aware Learning for Scene Text Recognition

Yi Zheng,Wenda Qin,Derry Wijaya,Margrit Betke

doi:10.1145/3394171.3413913

Abstract

Scene text recognition is the task of recognizing character sequences in images of natural scenes. The considerable diversity in the appearance of text in a scene image and potentially highly complex backgrounds make text recognition challenging. Previous approaches employ character sequence generators to analyze text regions and, subsequently, compare the candidate character sequences against a language model. In this work, we propose a bimodal framework that simultaneously utilizes visual and linguistic information to enhance recognition performance. Our linguistically aware learning (LAL) method effectively learns visual embeddings using a rectifier, encoder, and attention decoder approach, and linguistic embeddings, using a deep next-character prediction model. We present an innovative way of combining these two embeddings effectively. Our experiments on eight standard benchmarks show that our method outperforms previous methods by large margins, particularly on rotated, foreshortened, and curved text. We show that the bimodal approach has a statistically significant impact. We also contribute a new dataset, and show robust performance when LAL is combined with a text detector in a pipelined text spotting framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LAL: Linguistically Aware Learning for Scene Text Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images.
Asghar Ali Chandio ... Mehwish Leghari
Data in Brief | VOL. 31
Asghar Ali Chandio, et. al.Asghar Ali Chandio ... Mehwish Leghari
21 May 2020
Data in Brief | VOL. 31

SEE: Towards Semi-Supervised End-to-End Scene Text Recognition
Christian Bartz ... Haojin Yang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 32
Christian Bartz, et. al.Christian Bartz ... Haojin Yang
27 Apr 2018
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 32

Automatic localization and recognition of perspectively distorted text in natural scene images
Annmaria Cherian ... Sanju Sebastian
-
Annmaria Cherian, et. al.Annmaria Cherian ... Sanju Sebastian
01 Feb 2016
01 Feb 2016

Scene text detection and recognition with advances in deep learning: a survey
Xiyan Liu ... Chunhong Pan
International Journal on Document Analysis and Recognition (IJDAR) | VOL. 22
Xiyan Liu, et. al.Xiyan Liu ... Chunhong Pan
27 Mar 2019
International Journal on Document Analysis and Recognition (IJDAR) | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LAL: Linguistically Aware Learning for Scene Text Recognition

Abstract

Talk to us

Similar Papers