Convolutional Regression Network for Multi-Oriented Text Detection

Junyu Gao,Qi Wang,Yuan Yuan

doi:10.1109/access.2019.2929819

Abstract

Multi-oriented text detection in the wild is a challenging task due to the variations of scales, orientations, illumination, and languages. The traditional anchor mechanism on generic object detection can only generate horizontal proposals, which cannot be applied to detecting multi-oriented text regions. Considering this, in this paper, we propose a novel convolutional regression network (CRN) to localize multi-oriented text in natural images, which consists of two components: region proposal extractor and text locator. To be specific, we first present a hierarchical deconvolution module (HDM), a text-line and geometry segmentation module (TGM) to segment the multi-oriented proposals accurately, both of which are fully convolutional networks. Then, a classification and regression module (CRM) is adopted to process the proposals and obtain the final localization results. The whole framework can be trained in an end-to-end mechanism which is suitable for detecting multi-oriented texts. The extensive experiments are conducted on three mainstream scene-text datasets, and the experimental results evidence the proposed CRN achieves competitive performance.

Highlights

Reading text from the natural images has attracted much attention in the field of computer vision because of its numerous applications, such as image retrieval [1]–[4], robot navigation [5], [6], video analysis [7]–[9] and scene understanding [10]–[14]
We propose a single framework combining segmentation and detection in an end-to-end manner, which is named as Convolutional Regression Network (CRN)
We develop two modules to handle multi-oriented text, namely Hierarchical Deconvolution Module (HDM), Text-line and Geometry segmentation Module (TGM)

Summary

Introduction

Reading text from the natural images has attracted much attention in the field of computer vision because of its numerous applications, such as image retrieval [1]–[4], robot navigation [5], [6], video analysis [7]–[9] and scene understanding [10]–[14]. Accurate text localization/ detection [15]–[18] is a prerequisite for effectively understanding text. We focus on complicated multi-oriented text detection task. Previous works related to text detection usually contain many sequential steps, including character detection, character classification, text line construction and word splitting. These multi-step approaches are complicated and the error may be accumulated with the increase of steps. Many methods [19], [20] based on generic object detection

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 8	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Convolutional Regression Network for Multi-Oriented Text Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Detecting of Vertically-Oriented Texts in Images Containing Natural Scenes
Yi Ling Ong ... Bee Theng Lau
-
Yi Ling Ong, et. al.Yi Ling Ong ... Bee Theng Lau
07 Dec 2020
07 Dec 2020

Video frames text detection through Bayesian classification and boundary growing method
A Nancy ... D Jayapriya
-
A Nancy, et. al.A Nancy ... D Jayapriya
01 Feb 2014
01 Feb 2014

Multi-oriented text detection from natural scene images based on a CNN and pruning non-adjacent graph edges
Yuanwang Wei ... Zhijiang Zhang
Signal Processing: Image Communication | VOL. 64
Yuanwang Wei, et. al.Yuanwang Wei ... Zhijiang Zhang
08 Mar 2018
Signal Processing: Image Communication | VOL. 64

A Hybrid Deep Neural Network for Urdu Text Recognition in Natural Images
Asghar Ali ... Mark Pickering
-
Asghar Ali, et. al.Asghar Ali ... Mark Pickering
01 Jul 2019
01 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Convolutional Regression Network for Multi-Oriented Text Detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access