Resolution Enhancement for Low-resolution Text Images Using Generative Adversarial Network

Jie Kong,Congying Wang,X Lei,H Li,Y Yu,S Cai

doi:10.1051/matecconf/201824603040

Abstract

In recent years, although Optical Character Recognition (OCR) has made considerable progress, low-resolution text images commonly appearing in many scenarios may still cause errors in recognition. For this problem, the technique of Generative Adversarial Network in super-resolution processing is applied to enhance the resolution of low-quality text images in this study. The principle and the implementation in TensorFlow of this technique are introduced. On this basis, a system is proposed to perform the resolution enhancement and OCR for low-resolution text images. The experimental results indicate that this technique could significantly improve the accuracy, reduce the error rate and false rejection rate of low-resolution text images identification.

Highlights

In recent years, Optical Character Recognition (OCR) has been widely applied in the information input of data records on the printed paper
Text images generated many years ago may be limited by sampling devices and encoding algorithms, resulting in low-resolution, text in photos and videos may result in low-resolution after clipping and enlargement, which make the traditional OCR recognition technology unable to fulfil the corresponding requirements
An emerging machine learning technique: generative adversarial network (GAN) is adopted to build a super-resolution processing system to improve the performance of OCR recognition

Summary

Introduction

OCR has been widely applied in the information input of data records on the printed paper. OCR is the process of converting text image, such as the text on the handwriting document, printed document, scanned document, etc., to the machine-encoded text [1]. Some OCR recognition systems may produce errors in recognizing low-resolution text images This is because low-resolution text images lack highfrequency image details, which makes it difficult for OCR systems to retrieve text information correctly. This problem exists widely in practical applications. A solution for this problem is to perform superresolution processing on low-resolution text images, so as to achieve accurate recognition for the OCR [2]. An emerging machine learning technique: generative adversarial network (GAN) is adopted to build a super-resolution processing system to improve the performance of OCR recognition. The performance of proposed system is evaluated by test datasets

Related works

The mathematical model of GAN

The convolution layers

The activation function

The B Residual Block

The implementation of Discriminator

The content loss function

The adversarial loss function

The discriminator loss function

The architecture of resolution enhancement and OCR system for text image

The training of GAN

Findings

Conclusion

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Resolution Enhancement for Low-resolution Text Images Using Generative Adversarial Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: MATEC Web of Conferences

Lead the way for us

Journal: MATEC Web of Conferences	Publication Date: Jan 1, 2018
License type: CC BY 4.0

Similar Papers

Perceiving Multiple Representations for scene text image super-resolution guided by text recognizer
Qin Shi ... Dawei Yang
Engineering Applications of Artificial Intelligence | VOL. 124
Qin Shi, et. al.Qin Shi ... Dawei Yang
07 Jun 2023
Engineering Applications of Artificial Intelligence | VOL. 124

Improving Text Image Resolution using a Deep Generative Adversarial Network for Optical Character Recognition
Xiangdong Su ... Xiang Hao
-
Xiangdong Su, et. al.Xiangdong Su ... Xiang Hao
01 Sep 2019
01 Sep 2019

Scene Text Image Super-Resolution via Parallelly Contextual Attention Network
Cairong Zhao ... Heng Tao Shen
-
Cairong Zhao, et. al.Cairong Zhao ... Heng Tao Shen
17 Oct 2021
17 Oct 2021

MRI super-resolution via realistic downsampling with adversarial learning
Bangyan Huang ... Jing Cai
Physics in Medicine and Biology | VOL. 66
Bangyan Huang, et. al.Bangyan Huang ... Jing Cai
05 Oct 2021
Physics in Medicine and Biology | VOL. 66

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Resolution Enhancement for Low-resolution Text Images Using Generative Adversarial Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: MATEC Web of Conferences