DiZNet: An end-to-end text detection and recognition algorithm with detail in text zone

Di Zhou,Jianxun Zhang,Chao Li

doi:10.1016/j.jvcir.2024.104261

Abstract

This paper proposed an efficient and novel end-to-end text detection and recognition framework called DiZNet. DiZNet is built upon a core representation using text detail maps and employs the classical lightweight ResNet18 as the backbone for the text detection and recognition algorithm model. The redesigned Text Attention Head (TAH) takes multiple shallow backbone features as input, effectively extracting pixel-level information of text in images and global text positional features. The extracted text features are integrated into the stackable Feature Pyramid Enhancement Fusion Module (FPEFM). Supervised with text detail map labels, which include boundary information and texture of important text, the model predicts text detail maps and fuses them into the text detection and recognition heads. Through end-to-end testing on publicly available natural scene text benchmark datasets, our approach demonstrates robust generalization capabilities and real-time detection speeds. Leveraging the advantages of text detail map representation, DiZNet achieves a good balance between precision and efficiency on challenging datasets. For example, DiZNet achieves 91.2% Precision and 85.9% F-measure at a speed of 38.4 FPS on Total-Text and 83.8% F-measure at a speed of 30.0 FPS on ICDAR2015, it attains 83.8% F-measure at 30.0 FPS. The code is publicly available at: https://github.com/DiZ-gogogo/DiZNet

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DiZNet: An end-to-end text detection and recognition algorithm with detail in text zone

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation

Lead the way for us

Similar Papers

A survey of text detection and recognition algorithms based on deep learning technology
Xiao-Feng Wang ... Zhi-Ze Wu
Neurocomputing | VOL. 556
Xiao-Feng Wang, et. al.Xiao-Feng Wang ... Zhi-Ze Wu
18 Aug 2023
Neurocomputing | VOL. 556

Text Detection Based on Affine Transformation
Xiaoyue Jiang ... Lin Zhang
-
Xiaoyue Jiang, et. al.Xiaoyue Jiang ... Lin Zhang
01 Jan 2017
01 Jan 2017

Research on Text Recognition of Natural Scenes for Complex Situations
Wenhua Yu ... Mayire Ibrayim
-
Wenhua Yu, et. al.Wenhua Yu ... Mayire Ibrayim
22 Jul 2022
22 Jul 2022

An Efficient Text Recognition System from Complex Color Image for Helping the Visually Impaired Persons
Ahmed Ben Atitallah ... Khaled Kaaniche
Computer Systems Science and Engineering | VOL. 46
Ahmed Ben Atitallah, et. al.Ahmed Ben Atitallah ... Khaled Kaaniche
01 Jan 2023
Computer Systems Science and Engineering | VOL. 46

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DiZNet: An end-to-end text detection and recognition algorithm with detail in text zone

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation