Abstract

Text recognition in natural scenes has been a very challenging task in recent years, and rich text semantic information is of great significance for the understanding of a scene. However, text images in natural scenes often contain a lot of noise data, which leads to error detection. The problems of high error detection rate and low recognition accuracy have brought great challenges to the task of text recognition. To solve this problem, we propose a text recognition algorithm based on natural scenes. First, the task of text detection and recognition is completed in an end-to-end way in a framework, which can reduce the cumulative error prediction and calculation caused by cascading, and has higher real-time and faster speed. In addition, we integrate a multi-scale attention mechanism to obtain attention features of different scale feature maps. Finally, we use the efficient deep learning network (EE-ACNN), which combines a convolutional neural network (CNN) with an end-to-end algorithm and multi-scale attention to enrich the text features to be detected, expands its receptive field, produces good robustness to the effective natural text information, and improves the recognition performance. Through experiments on text data sets of natural scenes, the accuracy of this method reached 93.87%, which is nearly 0.96–1.02% higher than that of traditional methods, and which proves the feasibility of this method.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call