Binarization of Degraded Photographed Document Images- A Variational Denoising Auto Encoder

N Shobha Rani,Srinidhi A,Karthik S K,Bipin Nair B J

doi:10.1109/icirca51532.2021.9544864

Abstract

Document image enhancement is one of the prime researches in the area of optical character recognition and computer vision. Preprocessing procedure for a document depends on document layout, aging and document material type. This paper proposes a preprocessing technique for the enhancement of ancient and degraded document images. Initially the degraded patches from the document image is collected and used for learning through a variational de-noising autoencoder followed by document image enhancement. Ground truth images of the degraded patches are trained with the help of an adamax optimizer. A deep learning architecture comprised of five levels of convolution is devised for encoding and decoding process. Down sampling is initially performed in the encoding stage after each level of convolution. Further up sampling is conducted in the decoding stage. Experimentations are conducted on DIBCO (2016, 2013, 2012, 2011, 2010 and 2009) datasets and the results of enhancement are found to be promising with an average RMSE of 0.106 for batch size 1 and 24 epochs.

Full Text