Abstract

Data digitalization is the need of the present time as it makes data storage and processing quite convenient and fast. Presently, Libraries around the world are having a lot of precious knowledge in printed form, which needs to be converted in the digital domain for easy processing. An appropriate mechanical model along with opti-cal character recognition (OCR) ability could be a possible solution to this. In this work, a low cost automatic document scan machine has been proposed, which uses a raspberry pi module to turn the pages and captures the images containing text with the help of a camera. Generally, scanners available in the market are quite costly and lim-ited to scan the same page size. An automatic approach to scan varying length pages and GSMs (Grams per square meter) with an advantage of low cost can be very handy. The flexibility of changing positions of the sensor, camera, and motors helps the pro-posed device to do so. Image pre-processing is performed before feeding the scanned images into tesseract OCR to improve the accuracy of recognition.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.