(HTBNet)Arbitrary Shape Scene Text Detection with Binarization of Hyperbolic Tangent and Cross-Entropy.

Zhao Chen

doi:10.3390/e26070560

Abstract

The existing segmentation-based scene text detection methods mostly need complicated post-processing, and the post-processing operation is separated from the training process, which greatly reduces the detection performance. The previous method, DBNet, successfully simplified post-processing and integrated post-processing into a segmentation network. However, the training process of the model took a long time for 1200 epochs and the sensitivity to texts of various scales was lacking, leading to some text instances being missed. Considering the above two problems, we design the text detection Network with Binarization of Hyperbolic Tangent (HTBNet). First of all, we propose the Binarization of Hyperbolic Tangent (HTB), optimized along with which the segmentation network can expedite the initial convergent speed by reducing the number of epochs from 1200 to 600. Because features of different channels in the same scale feature map focus on the information of different regions in the image, to better represent the important features of all objects in the image, we devise the Multi-Scale Channel Attention (MSCA). Meanwhile, considering that multi-scale objects in the image cannot be simultaneously detected, we propose a novel module named Fused Module with Channel and Spatial (FMCS), which can fuse the multi-scale feature maps from channel and spatial dimensions. Finally, we adopt cross-entropy as the loss function, which measures the difference between predicted values and ground truths. The experimental results show that HTBNet, compared with lightweight models, has achieved competitive performance and speed on Total-Text (F-measure:86.0%, FPS:30) and MSRA-TD500 (F-measure:87.5%, FPS:30).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

(HTBNet)Arbitrary Shape Scene Text Detection with Binarization of Hyperbolic Tangent and Cross-Entropy.

Abstract

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)

Lead the way for us

Journal: Entropy (Basel, Switzerland)	Publication Date: Jun 29, 2024
License type: CC BY 4.0

Similar Papers

Real-Time Scene Text Detection With Differentiable Binarization and Adaptive Scale Fusion.
Minghui Liao ... Xiang Bai
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Minghui Liao, et. al.Minghui Liao ... Xiang Bai
01 Jan 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

A Robust Method: Arbitrary Shape Text Detection Combining Semantic and Position Information
Zhenchao Wang ... Yuze Li
Sensors | VOL. 22
Zhenchao Wang, et. al.Zhenchao Wang ... Yuze Li
18 Dec 2022
Sensors | VOL. 22

Arbitrary Scene Text Detection with Bezier Proposal
Yuanyu Chen ... Yihong Li
-
Yuanyu Chen, et. al.Yuanyu Chen ... Yihong Li
29 Oct 2021
29 Oct 2021

Scene text detection and recognition with advances in deep learning: a survey
Xiyan Liu ... Chunhong Pan
International Journal on Document Analysis and Recognition (IJDAR) | VOL. 22
Xiyan Liu, et. al.Xiyan Liu ... Chunhong Pan
27 Mar 2019
International Journal on Document Analysis and Recognition (IJDAR) | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

(HTBNet)Arbitrary Shape Scene Text Detection with Binarization of Hyperbolic Tangent and Cross-Entropy.

Abstract

Talk to us

Similar Papers

More From: Entropy (Basel, Switzerland)