Abstract

In this paper, variety of document image enhancement techniques are applied for removal of background noise in the degraded document images. The noise removal techniques are applied on different forms of noise including non-uniform illuminations, complex stain marks, user annotations, show through effect and foxing effect. In this work, Binary Image Analysis (BIA) Technique is proposed for removal of aging degradation in ancient document images of Kannada literature. The method involves multiple phases comprising of contrast enhancement, Gaussian smoothing, binarization, morphological processing, object detection using connected component analysis and filtering followed by marginal noise removal of non-textual regions. The document samples employed for experimentation comprised of more than 175 aged and highly degraded scanned documents of old Kannada literature and poetry that are massively affected by noise and 25 images from DIBCO datasets collected across 2009 to 2017. The results of the experimentation are quite satisfactory and suitable enough for processing of document images in the subsequent stages of OCR..The experimentations are compared with some widely used approaches like Sauvola, Otsu, Gaussian. It is noticed that the proposed method outperforms other noise removal methods in terms of character retentions for extensively degraded and aged document images.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.