Abstract

Since low-resolution images may hamper the performance of optical character recognition (OCR), text image super-resolution (SR) has become an increasingly important problem in computer vision. Convolutional neural network (CNN) has been proposed for generic image SR as well as text image SR, but the previous works concern more on the objective quality (e.g. PSNR) rather than the OCR performance. In this paper, we propose a new loss function when training CNN for text image SR to facilitate OCR, and conduct model combination to further improve the performance. Also, we propose a simple yet effective image padding method to refine the image boundaries during SR. Experimental results show that we achieve an OCR accuracy of 78.10% on the ICDAR 2015 TextSR dataset, which is comparable with that of using the original high-resolution images (78.80%), and also exceeds the state-of-the-arts.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call