Improving patch-based scene text script identification with ensembles of conjoined networks

Lluis Gomez,Anguelos Nicolaou,Dimosthenis Karatzas

doi:10.1016/j.patcog.2017.01.032

Abstract

This paper focuses on the problem of script identification in scene text images. Facing this problem with state of the art CNN classifiers is not straightforward, as they fail to address a key characteristic of scene text instances: their extremely variable aspect ratio. Instead of resizing input images to a fixed aspect ratio as in the typical use of holistic CNN classifiers, we propose here a patch-based classification framework in order to preserve discriminative parts of the image that are characteristic of its class.We describe a novel method based on the use of ensembles of conjoined networks to jointly learn discriminative stroke-parts representations and their relative importance in a patch-based classification scheme. Our experiments with this learning procedure demonstrate state-of-the-art results in two public script identification datasets.In addition, we propose a new public benchmark dataset for the evaluation of multi-lingual scene text end-to-end reading systems. Experiments done in this dataset demonstrate the key role of script identification in a complete end-to-end system that combines our script identification method with a previously published text detector and an off-the-shelf OCR engine.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving patch-based scene text script identification with ensembles of conjoined networks

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Feb 3, 2017
Citations: 81

Similar Papers

A Fine-Grained Approach to Scene Text Script Identification
Lluis Gomez ... Dimosthenis Karatzas
-
Lluis Gomez, et. al.Lluis Gomez ... Dimosthenis Karatzas
01 Apr 2016
01 Apr 2016

Script identification in natural scene image and video frames using an attention based Convolutional-LSTM network
Ankan Kumar Bhunia ... Umapada Pal
Pattern Recognition | VOL. 85
Ankan Kumar Bhunia, et. al.Ankan Kumar Bhunia ... Umapada Pal
02 Aug 2018
Pattern Recognition | VOL. 85

EA-ConvNeXt: An Approach to Script Identification in Natural Scenes Based on Edge Flow and Coordinate Attention
Zhiyun Zhang ... Alimjan Aysa
Electronics | VOL. 12
Zhiyun Zhang, et. al.Zhiyun Zhang ... Alimjan Aysa
27 Jun 2023
Electronics | VOL. 12

MLTS: A Multi-Language Scene Text Spotter
Yu Zhou ... Hongtao Xie
-
Yu Zhou, et. al.Yu Zhou ... Hongtao Xie
01 Jul 2019
01 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving patch-based scene text script identification with ensembles of conjoined networks

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition