Mathrm T^2Net: an improved image-based text transfer framework using background inpainting and text conversion

Haibin Zhou,Haijun Zhang,Boxiang Jia,Lujiao Shao

doi:10.1007/s44244-023-00010-6

Haibin Zhou, Haijun Zhang + Show 2 more

Open Access

https://doi.org/10.1007/s44244-023-00010-6

Copy DOI

Abstract

Text, which is regarded as one of the important clues for visual recognition, can provide rich and accurate high-level semantic information. Therefore, the detection and recognition of textual data have become a research hotspot in computer vision and artificial intelligence. However, the difficulty of data collection and the non-uniform distribution of characters still poses challenges for accurate text recognition, especially for recognizing complicated character sets, such as Chinese. To address small-sample text recognition, we propose an improved image-based text transfer framework, named mathrm T^2Net. This work can replace or modify the text content in an image so as to arbitrarily expand a recognition data set. Considering that the main challenge of text transfer lies in decoupling the complex interrelationship between text and background, a text content mask branch is first added into a background inpainting module so as to more realistically restore background textures. Second, a text recognition model is developed to guide the readability of the text transfer results in the text conversion module. Finally, a text fusion module is used to fuse the independent migrations of background and text. We examined the performance of our proposed framework in a real-word scene text recognition data set. Qualitative and quantitative results have proved the efficiency of our method in comparison with previous works.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mathrm T^2Net: an improved image-based text transfer framework using background inpainting and text conversion

Abstract

Talk to us

Similar Papers

More From: Industrial Artificial Intelligence

Lead the way for us

Journal: Industrial Artificial Intelligence	Publication Date: Jul 11, 2023
License type: CC BY 4.0

Similar Papers

A survey of text detection and recognition algorithms based on deep learning technology
Xiao-Feng Wang ... Zhi-Ze Wu
Neurocomputing | VOL. 556
Xiao-Feng Wang, et. al.Xiao-Feng Wang ... Zhi-Ze Wu
18 Aug 2023
Neurocomputing | VOL. 556

Vietnamese Scene Text Detection and Recognition using Deep Learning: An Empirical Study
Nhat Truong Pham ... Qui Nguyen-Van
-
Nhat Truong Pham, et. al.Nhat Truong Pham ... Qui Nguyen-Van
29 Jul 2022
29 Jul 2022

Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images.
Asghar Ali Chandio ... Mehwish Leghari
Data in Brief | VOL. 31
Asghar Ali Chandio, et. al.Asghar Ali Chandio ... Mehwish Leghari
21 May 2020
Data in Brief | VOL. 31

An end-to-end model for multi-view scene text recognition
Ayan Banerjee ... Cheng-Lin Liu
Pattern Recognition | VOL. 149
Ayan Banerjee, et. al.Ayan Banerjee ... Cheng-Lin Liu
17 Dec 2023
Pattern Recognition | VOL. 149

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mathrm T^2Net: an improved image-based text transfer framework using background inpainting and text conversion

Abstract

Talk to us

Similar Papers

More From: Industrial Artificial Intelligence