TSER: A Two-Stage Character Segmentation Network With Two-Stream Attention and Edge Refinement

Jinyingming Zhang,Jin Liu,Xiongwei Xu,Mingyang Duan,Peizhu Gong

doi:10.1109/access.2020.3036545

Abstract

Segmenting characters in an image is a classic yet challenging task in computer vision. Correctly determining boundaries of adhesive characters with various scales and shapes is essential for character segmentation, especially for separating handwritten characters. Nevertheless, there is seldom work in the literature which can achieve satisfactory performance. In this article, by leveraging the ability of deep neural networks, we proposed a two-stage character segmentation network with two-stream attention and edge refinement (TSER) to tackle this problem. TSER firstly locates every character by object detection, then extracts their corresponding contours. In the process, a novel two-stream attention mechanism (TSAM) is proposed to make the network focus more on the discrepancy of character boundaries. Furthermore, a novel generating method is used to dynamically generate anchors on different feature levels to improve model’s sensitivity on the shapes and scales of characters. Eventually a cascaded edge refinement network is used to obtain contour of each character. To prove the efficiency and generalization ability of our model, we compared TSER with traditional algorithms and other deep learning models on two commonly used datasets in different segmentation tasks. The comparative result indicated that TSER reached state-of-the-art performance.

Highlights

Character segmentation is a vital step in traditional optical text recognition process
We proposed a novel character segmentation network two-stream attention and edge refinement (TSER) that aims at segmenting characters from text line in images
TSER has four major contributions: 1) A novel two-stage segmentation network focusing on character segmentation task was proposed

Summary

INTRODUCTION

Character segmentation is a vital step in traditional optical text recognition process. We propose a two-stage character segmentation network, which can accurately segment characters under various situations, namely normal spacing, subtle spacing, adhesive characters, partially overlapping characters, characters with deflection angles, and characters with different scales and shapes, from text line images with random noise. A two-stream attention mechanism is proposed to guide the feature selection process of model This attention mechanism contributes to distinguish the boundary of adhesive characters and assists to find out every character instance. A guided anchoring method is applied to produce sparse and appropriate anchors instead of the dense and redundant ones in traditional region proposal network This module can reduce computational cost and generate anchors that meet various character shapes and scales.

RELATED WORK

PROBLEM DEFINITION

OVERALL NETWORK ARCHITECTURE

Calculating area of Bc

OTHER REMEDIES

MODEL TRAINING

EXPERIMENTS

Method

ABLATION STUDY

TIME COST Training

CONCLUSIONS

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE access : practical innovations, open solutions	Publication Date: Jan 1, 2020
Citations: 39	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

TSER: A Two-Stage Character Segmentation Network With Two-Stream Attention and Edge Refinement

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions

Lead the way for us

Similar Papers

Video Object Detection with an Improved Classification Approach
Sita Yadav ... Sandeep M Chaware
-
Sita Yadav, et. al.Sita Yadav ... Sandeep M Chaware
01 Jan 2023
01 Jan 2023

Model distillation for high-level semantic understanding：a survey
Sun Ruoyu ... Xiong Hongkai
Journal of Image and Graphics | VOL. 28
Sun Ruoyu, et. al.Sun Ruoyu ... Xiong Hongkai
01 Jan 2023
Journal of Image and Graphics | VOL. 28

Object detection from UAV thermal infrared images and videos using YOLO models
Chenchen Jiang ... Huazhong Ren
International Journal of Applied Earth Observation and Geoinformation | VOL. 112
Chenchen Jiang, et. al.Chenchen Jiang ... Huazhong Ren
01 Aug 2022
International Journal of Applied Earth Observation and Geoinformation | VOL. 112

A survey on generative adversarial networks for imbalance problems in computer vision tasks
Vignesh Sampath ... Aitor Gutierrez
Journal of Big Data | VOL. 8
Vignesh Sampath, et. al.Vignesh Sampath ... Aitor Gutierrez
29 Jan 2021
Journal of Big Data | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TSER: A Two-Stage Character Segmentation Network With Two-Stream Attention and Edge Refinement

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions