Complex documents images segmentation based on steerable pyramid features

Mohamed Benjelil,Adel M Alimi,Rémy Mullot,Slim Kanoun

doi:10.1007/s10032-010-0113-9

Abstract

Page segmentation and classification is very important in document layout analysis system before it is presented to an OCR system or for any other subsequent processing steps. In this paper, we propose an accurate and suitably designed system for complex documents segmentation. This system is based on steerable pyramid transform. The features extracted from pyramid sub-bands serve to locate and classify regions into text (either machine-printed or handwritten) and non-text (images, graphics, drawings or paintings) in some noise-infected, deformed, multilingual, multi-script document images. These documents contain tabular structures, logos, stamps, handwritten script blocks, photographs, etc. The encouraging and promising results obtained on 1,000 official complex document images data set are presented in this research paper. We compared our results with those from existing state-of-the-art methods. This comparison shows that the proposed method performs consistently well on large sets of complex document images.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Complex documents images segmentation based on steerable pyramid features

Abstract

Talk to us

Similar Papers

More From: International Journal on Document Analysis and Recognition (IJDAR)

Lead the way for us

Journal: International Journal on Document Analysis and Recognition (IJDAR)	Publication Date: Mar 10, 2010
Citations: 55

Similar Papers

Page Segmentation Based on Steerable Pyramid Features
Mohamed Benjelil ... Remy Mullot
-
Mohamed Benjelil, et. al.Mohamed Benjelil ... Remy Mullot
01 Sep 2012
01 Sep 2012

Steerable Pyramid Based Complex Documents Images Segmentation
Mohamed Benjelil ... Adel M Alimi
-
Mohamed Benjelil, et. al.Mohamed Benjelil ... Adel M Alimi
01 Jan 2009
01 Jan 2009

A Unified Algorithm for Identification of Various Tabular Structures from Document Images
Sekhar Mandal ... Partha Bhowmick
International Journal of Digital Library Systems | VOL. 2
Sekhar Mandal, et. al.Sekhar Mandal ... Partha Bhowmick
01 Jan 2010
International Journal of Digital Library Systems | VOL. 2

Page Segmentation using XY Cut Algorithm in OCR Systems - A Review
Sukhvir Kaur ... Sukhwinder Kaur
INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY | VOL. 6
Sukhvir Kaur, et. al.Sukhvir Kaur ... Sukhwinder Kaur
30 Dec 2008
INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Complex documents images segmentation based on steerable pyramid features

Abstract

Talk to us

Similar Papers

More From: International Journal on Document Analysis and Recognition (IJDAR)