Looking from a Higher-Level Perspective: Attention and Recognition Enhanced Multi-scale Scene Text Segmentation

Yujin Ren,Xiaoyi Zhang,Jiaxin Zhang,Lianwen Jin,Bangdong Chen

doi:10.1007/978-3-031-26293-7_38

Abstract

Scene text segmentation, which aims to generate pixel-level text masks, is an integral part of many fine-grained text tasks, such as text editing and text removal. Multi-scale irregular scene texts are often trapped in complex background noise around the image, and their textures are diverse and sometimes even similar to those of the background. These specific problems bring challenges that make general segmentation methods ineffective in the context of scene text. To tackle the aforementioned issues, we propose a new scene text segmentation pipeline called Attention and Recognition enhanced Multi-scale segmentation Network (ARM-Net), which consists of three main components: Text Segmentation Module (TSM) generates rectangular receptive fields of various sizes to fit scene text and integrate global information adequately; Dual Perceptual Decoder (DPD) strengthens the connection between pixels that belong to the same category from the spatial and channel perspective simultaneously during upsampling, and Recognition Enhanced Module (REM) provides text attention maps as a prior for the segmentation network, which can inherently distinguish text from background noise. Via extensive experiments, we demonstrate the effectiveness of each module of ARM-Net, and its performance surpasses that of existing state-of-the-art scene text segmentation methods. We also show that the pixel-level mask produced by our method can further improve the performance of text removal and scene text recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Looking from a Higher-Level Perspective: Attention and Recognition Enhanced Multi-scale Scene Text Segmentation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Scene Text Segmentation with Multi-level Maximally Stable Extremal Regions
Shangxuan Tian ... Bolan Su
-
Shangxuan Tian, et. al.Shangxuan Tian ... Bolan Su
01 Aug 2014
01 Aug 2014

Occluded Text Detection and Recognition in the Wild
Zobeir Raisi ... John Zelek
-
Zobeir Raisi, et. al.Zobeir Raisi ... John Zelek
01 May 2022
01 May 2022

STV2k
Pingping Xiao ... Da-Han Wang
-
Pingping Xiao, et. al.Pingping Xiao ... Da-Han Wang
19 Aug 2016
19 Aug 2016

Dictionary-guided Scene Text Recognition
Nguyen Nguyen ... Vinh Tran
-
Nguyen Nguyen, et. al.Nguyen Nguyen ... Vinh Tran
01 Jun 2021
01 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Looking from a Higher-Level Perspective: Attention and Recognition Enhanced Multi-scale Scene Text Segmentation

Abstract

Talk to us

Similar Papers