R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection

Yuxin Wang,Hongtao Xie,Youliang Tian,Yongdong Zhang,Zilong Fu,Zhengjun Zha

doi:10.1109/tmm.2020.2995290

Abstract

This paper introduces a novel bi-directional con-volutional framework to cope with the large-variance scale problem in scene text detection. Due to the lack of scale normalization in recent CNN-based methods, text instances with large-variance scale are activated inconsistently in feature maps, which makes it hard for CNN-based methods to accurately locate multi-size text instances. Thus, we propose the relationship network (R-Net) that maps multi-scale convolutional features to a scale-invariant space to obtain consistent activation of multi-size text instances. Firstly, we implement an FPN-like backbone with a Spatial Relationship Module (SPM) to extract multi-scale features with powerful spatial semantics. Then, a Scale Relationship Module (SRM) constructed on feature pyramid propagates contextual scale information in sequential features through a bi-directional convolutional operation. SRM supplements the multi-scale information in different feature maps to obtain consistent activation of multi-size text instances. Compared with previous approaches, R-Net effectively handles the large-variance scale problem without complicated post processing and complex hand-crafted hyperparameter setting. Extensive experiments conducted on several benchmarks verify that our R-Net obtains state-of-the-art performance on both accuracy and efficiency. More specifically, R-Net achieves an F-measure of 85.6% at 21.4 frames/s and an F-measure of 81.7% at 11.8 frames/s for ICDAR 2015 and MSRA-TD500 datasets respectively, which is the latest SOTA. The code is available on https://github.com/wangyuxin87/R-Net .

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Journal: IEEE Transactions on Multimedia	Publication Date: May 21, 2020
Citations: 87

Similar Papers

DSRN: A Deep Scale Relationship Network for Scene Text Detection
Yuxin Wang ... Yongdong Zhang
-
Yuxin Wang, et. al.Yuxin Wang ... Yongdong Zhang
01 Aug 2019
01 Aug 2019

A Multi-Scale Natural Scene Text Detection Method Based on Attention Feature Extraction and Cascade Feature Fusion.
Nianfeng Li ... Zhenyan Wang
Sensors (Basel, Switzerland) | VOL. 24
Nianfeng Li, et. al.Nianfeng Li ... Zhenyan Wang
09 Jun 2024
Sensors (Basel, Switzerland) | VOL. 24

DPNet: Scene text detection based on dual perspective CNN-transformer.
Yuan Li
PloS one | VOL. 19
Yuan LiYuan Li
21 Oct 2024
PloS one | VOL. 19

Text Attention and Focal Negative Loss for Scene Text Detection
Randong Huang ... Bo Xu
-
Randong Huang, et. al.Randong Huang ... Bo Xu
01 Jul 2019
01 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

R-Net: A Relationship Network for Efficient and Accurate Scene Text Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia