Learning Deeply Supervised Scene Text Detectors from Scratch

Wei Zhu,Mingwu Ren,Qingyuan Xia

doi:10.1088/1742-6596/1069/1/012008

Wei Zhu, Mingwu Ren + Show 1 more

Open Access

https://doi.org/10.1088/1742-6596/1069/1/012008

Copy DOI

Abstract

In this paper, we propose deeply supervised scene text detector (DSTD), a framework that can be learned from scratch. Our proposed method mainly addresses two problems. The first one is that state-of-the-art text detectors rely heavily on the off-the-shelf pre-trained models, which leads to several limitations including inflexibility and domain mismatch. The second problem is that unlike general objects, scene text usually appear in arbitrary orientations. Text detection using horizontal bounding boxes is inaccurate. In DSTD, we propose to regress rotated rectangles directly from horizontal default boxes to deal with multi-oriented text. Furthermore, we abandon the heavy pre-trained model from the SSD framework and incorporate dense layer-wise connections, enabling the network to be learned from scratch. The proposed method is evaluated on two public datasets, namely ICDAR2013 and ICDAR2015. Experimental results demonstrate its superiority over several state-of-the-art approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Deeply Supervised Scene Text Detectors from Scratch

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Journal: Journal of Physics: Conference Series	Publication Date: Aug 1, 2018
License type: cc-by

Similar Papers

TextField: Learning a Deep Direction Field for Irregular Scene Text Detection.
Yongchao Xu ... Xiang Bai
IEEE Transactions on Image Processing | VOL. 28
Yongchao Xu, et. al.Yongchao Xu ... Xiang Bai
21 Feb 2019
IEEE Transactions on Image Processing | VOL. 28

Scene text detection with fully convolutional neural networks
Zhandong Liu ... Houqiang Li
Multimedia Tools and Applications | VOL. 78
Zhandong Liu, et. al.Zhandong Liu ... Houqiang Li
21 Jan 2019
Multimedia Tools and Applications | VOL. 78

VDetor: An Effective and Efficient Neural Network for Vehicle Detection in Aerial Image
Zhengquan Piao ... Baojun Zhao
-
Zhengquan Piao, et. al.Zhengquan Piao ... Baojun Zhao
01 Dec 2019
01 Dec 2019

IoU-Related Arbitrary Shape Text Scoring Detector
Fagui Liu ... Dian Gu
IEEE Access | VOL. 7
Fagui Liu, et. al.Fagui Liu ... Dian Gu
01 Jan 2019
IEEE Access | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Deeply Supervised Scene Text Detectors from Scratch

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series