A simple and effective table detection system from document images

S Mandal,S P Chowdhury,A K Das,Bhabatosh Chanda

doi:10.1007/s10032-005-0006-5

A simple and effective table detection system from document images

S Mandal, S P Chowdhury + Show 2 more

https://doi.org/10.1007/s10032-005-0006-5

Copy DOI

Journal: International Journal on Document Analysis and Recognition (IJDAR)	Publication Date: Mar 24, 2006
Citations: 78

Affiliation: Indian Institute of Engineering Science and Technology, Shibpur, Indian Statistical Institute

#Identification Of Tables #Simple Detection System + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The requirement of detection and identification of tables from document images is crucial to any document image analysis and digital library system. In this paper we report a very simple but extremely powerful approach to detect tables present in document pages. The algorithm relies on the observation that the tables have distinct columns which implies that gaps between the fields are substantially larger than the gaps between the words in text lines. This deceptively simple observation has led to the design of a simple but powerful table detection system with low computation cost. Moreover, mathematical foundation of the approach is also established including formation of a regular expression for ease of implementation.

Full Text