Detection and Segmentation of Table of Contents and Index Pages from Document Images

S Mandal,A.K Das,B Chanda,S.P Chowdhury

doi:10.1109/dial.2006.13

Detection and Segmentation of Table of Contents and Index Pages from Document Images

S Mandal, A.K Das + Show 2 more

https://doi.org/10.1109/dial.2006.13

Copy DOI

Publication Date: Apr 27, 2006

Citations: 11

Affiliation: Indian Institute of Engineering Science and Technology, Shibpur, Indian Statistical Institute

#Index Pages #Table Of Contents Pages + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Identification and segmentation of the table of contents (TOC) and index pages for the development of a digital library is an obvious task. A digital document library is created to provide a non-labour intensive, cheap and flexible way of storage, representation and management of paper documents in electronic form to facilitate indexing, viewing, printing and extracting the intended portions. Using document image analysis techniques information from the TOC and index pages may be extracted to use in a document database for effective retrieval of the required pieces of information. In this paper, we present fully automatic identification and segmentation of TOC and index pages from scanned documents.

Full Text