Abstract

Optical character recognition (OCR) technology is promoting the process of automation in daily information recording and inspection of industrial meters. However, bad installation scenes and long-time running cause industrial meters to age to varying degrees, which increases the difficulty of OCR. Besides, in practice, operators typically use handheld cameras to extract tag information from industrial meters. During the shooting process, extreme lighting, free shooting angles, and shooting distance also cause many difficulties for OCR. Considering such difficulties, an end-to-end recognition architecture is developed to obtain a better OCR performance. The proposed architecture can quickly extract structured information from normal or skewed text images. A novel yolov5_adaloss with a specific penalty factor is designed to alleviate the influence of illumination, installation scene, age, skew, and distance on the classification accuracy of Tags. For images with large skew angles, a mathematical method is proposed to calculate the text inclination angle for better OCR performance. The contribution of this work is twofold. Firstly, the architecture proposed is lightweight, which only needs a low computing cost and a short inference time. Secondly, as a practical application-oriented architecture, this work does not require much training, labeling, and fine-tuning, which is easy to generalize to other structured text recognition tasks. Experiments show that the method proposed in this paper can achieve excellent performance in meter tag recognition tasks on actual industrial images of meters.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call