Residual attention-based multi-scale script identification in scene text images

Mengkai Ma,Qiu-Feng Wang,Shan Huang,Shen Huang,Yannis Goulermas,Kaizhu Huang

doi:10.1016/j.neucom.2020.09.015

Abstract

Script identification is an essential step in the text extraction pipeline for multi-lingual application. This paper presents an effective approach to identify scripts in scene text images. Due to the complicated background, various text styles, character similarity of different languages, script identification has not been solved yet. Under the general classification framework of script identification, we investigate two important components: feature extraction and classification layer. In the feature extraction, we utilize a hierarchical feature fusion block to extract the multi-scale features. Furthermore, we adopt an attention mechanism to obtain the local discriminative parts of feature maps. In the classification layer, we utilize a fully convolutional classifier to generate channel-level classifications which are then processed by a global pooling layer to improve classification efficiency. We evaluated the proposed approach on benchmark datasets of RRC-MLT2017, SIW-13, CVSI-2015 and MLe2e, and the experimental results show the effectiveness of each elaborate designed component. Finally, we achieve better performances than those competitive models, where the correct rates are 89.66%, 96.11%, 98.78% and 97.20% on PRC-MLT2017, SIW-13, CVSI-2015 and MLe2e, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Residual attention-based multi-scale script identification in scene text images

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Sep 28, 2020
Citations: 23

Similar Papers

Script identification in natural scene image and video frames using an attention based Convolutional-LSTM network
Ankan Kumar Bhunia ... Umapada Pal
Pattern Recognition | VOL. 85
Ankan Kumar Bhunia, et. al.Ankan Kumar Bhunia ... Umapada Pal
02 Aug 2018
Pattern Recognition | VOL. 85

Improving patch-based scene text script identification with ensembles of conjoined networks
Lluis Gomez ... Dimosthenis Karatzas
Pattern Recognition | VOL. 67
Lluis Gomez, et. al.Lluis Gomez ... Dimosthenis Karatzas
03 Feb 2017
Pattern Recognition | VOL. 67

MLTS: A Multi-Language Scene Text Spotter
Yu Zhou ... Hongtao Xie
-
Yu Zhou, et. al.Yu Zhou ... Hongtao Xie
01 Jul 2019
01 Jul 2019

A New Method for Arabic Text Detection in Natural Scene Images
Houda Gaddour ... Slim Kanoun
International Journal of Image and Graphics | VOL. 23
Houda Gaddour, et. al.Houda Gaddour ... Slim Kanoun
17 Dec 2021
International Journal of Image and Graphics | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Residual attention-based multi-scale script identification in scene text images

Abstract

Talk to us

Similar Papers

More From: Neurocomputing