HU‐PageScan: a fully convolutional neural network for document page crop

Ricardo Batista Das Neves,Byron L.D Bezerra,Estanislau Lima,Cleber Zanchettin,Alejandro H Toselli

doi:10.1049/iet-ipr.2020.0532

Abstract

November The offer of online, automated, and impersonal services demand users to upload scanned copies of their documents to the organisations. As a consequence of this decentralisation, the documents present more challenges to the already complex process of image processing and information extraction. To address this problem, the authors presented an optimised fully convolutional neural network model for document segmentation that works on mobile devices to detect the region of the document in the captured image. They performed experiments in three representative datasets comparing the proposed method with the Geodesic object Proposals, U-net, Mask R-CNN, and OctHU-PageScan algorithms. They also compared the proposed model with all competitors of the ICDAR2015 Competition on smartphone document capture. Furthermore, they performed a qualitative and comparative analysis with the CamScanner software, a popular app for Android and iOS smartphones used for more than 100 million users in over 200 countries. The proposed approach achieved a significant performance compared with the current state-of-the-art methods, providing a powerful approach for document segmentation in photos and scanned images.

Full Text