Abstract

Invoice text detection aims to automatically identify key information in invoices, which is one of the representative applications of Optical Character Recognition (OCR) technology and a research hotspot in the computer vision community. The classic traditional OCR method consists of three key steps: preprocessing stage, text detection and text recognition. In the feature extraction stage, it faces problems such as insufficient generalization ability and poor robustness. In this paper, we reveal the great potential of preprocessing strategies in improving OCR accuracy, and propose an invoice text detection method based on DBNet and differentiable binarization thresholds. Specifically, we first introduce color segmentation and local adaptive threshold to improve the preprocessing process, which can effectively suppress the influence of background information and context noise on detection results. In addition, a differentiable binarization threshold is introduced in the feature extraction, which improves the error correction speed in the whole deep learning process. Studies have shown significant improvements in test results after color segmentation or background removal compared to the original image. Therefore, we propose to properly consider the text detection of invoices in the preprocessing stage for improvement.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.