Abstract

Digital cameras and mobile image capture of documents are two examples of new developments in the fields of optical character recognition and text recognition. Scans of text or text photographic images and even natural photography results can be distorted to the point where OCR digitization is inaccurate. It offers a unique non-parametric unattended approach to correct unwanted document image distortions to achieve optimal OCR accuracy. It applies a highly effective stack of document image enhancement algorithms to restore perfect images distorted by unknown sources of distortion. First, it provides a means of modifying local brightness and contrast in order to better handle different illumination levels and atypical light transmission patterns in the image. Then apply a nifty grayscale conversion method to your photo to give it a new look. Third, it uses unsharp masking techniques to further enhance important details in grayscale images. Finally, we use the best global binarization technique to prepare the final document image for OCR recognition. The proposed technique has the potential to significantly improve the text recognition rate and accuracy of optical character recognition.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call