Abstract

This paper presents a novel method for scanning duplex-printed documents without incurring the unwanted show-through artifact. The proposed method achieves the goal of eliminating the leaked-out reverse-side content by fusing a white backed scan image with a black backed scan image of the document. The fusion is accomplished using a multilayer perceptron having learned a fusion mapping from manually corrected document images. The main novel contributions of this work include (1) being the first to propose to accomplish the goal of show through free scanning by fusing a white backed scan image with a black backed scan image of the document; (2) proposing a learning approach using a multilayer perceptron to learn the fusion mapping from manually corrected scan images; and (3) proposing to use the pixel value histogram of reverse-side-printed area as well as the pixel value histogram of duplex-printed area to quantitatively indicate show through severity to facilitate objective comparison of the methods in consideration. The experiment results show that the proposed method is remarkably more powerful in eliminating show through than the two state-of-the-art methods in comparison.

Highlights

  • Document scanning has become an office routine being performed every day and everywhere to capture digital image of document page for convenient storage, copying, transmission, processing, analysis, and recognition etc

  • One major deficiency of the existing scanning methods is that the text and image content on the reverse side of duplex-printed document may show through the paper substrate to appear in the scan image

  • In addition to visual inspection and comparison of the resulting images obtained by the three methods, we propose to use the pixel value histogram of reverse-side-printed area as well as the pixel value histogram of duplex-printed area to quantitatively indicate the severity of the show-through to enable objective comparison

Read more

Summary

Introduction

Document scanning has become an office routine being performed every day and everywhere to capture digital image of document page for convenient storage, copying, transmission, processing, analysis, and recognition etc. This approach brings about the undesirable side effects of showing up the material texture of the paper substrate, and producing black spots if the document contains worn out holes, and giving rise to black borders if the document is smaller than the scan area Another existing way to overcome the show through problem is to scan both the front side and back side of the page, and use the front side image and a flipped and registered version of the reverse side image to achieve the goal [2,3,4,5,6,7,8,9]. The scaling factor μ is used to fit the pixel value range to the input and output ranges of the perceptron, and is set to 0.8/256 for conventional digital image of 8-bitper-channel pixels

Experiments and discussions
Findings
Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.