Text extraction from graphical document images using sparse representation

Thai V Hoang,Salvatore Tabbone

doi:10.1145/1815330.1815349

Abstract

A novel text extraction method from graphical document images is presented in this paper. Graphical document images containing text and graphics components are considered as two-dimensional signals by which text and graphics have different morphological characteristics. The proposed algorithm relies upon a sparse representation framework with two appropriately chosen discriminative overcomplete dictionaries, each one gives sparse representation over one type of signal and non-sparse representation over the other. Separation of text and graphics components is obtained by promoting sparse representation of input images in these two dictionaries. Some heuristic rules are used for grouping text components into text strings in post-processing steps. The proposed method overcomes the problem of touching between text and graphics. Preliminary experiments show some promising results on different types of document.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Text extraction from graphical document images using sparse representation

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jun 9, 2010
Citations: 62	License type: other-oa

Similar Papers

Computationally Efficient Formulation of Sparse Color Image Recovery in the JPEG Compressed Domain
Camelia Florea ... Aurel Vlaicu
Journal of Mathematical Imaging and Vision | VOL. 49
Camelia Florea, et. al.Camelia Florea ... Aurel Vlaicu
10 Jul 2013
Journal of Mathematical Imaging and Vision | VOL. 49

Improved Sparse Representation of Image from Inferred Angles of Steerable Wavelet
Jignesh Bhavsar ... Kalpesh Patel
Indian Journal Of Science And Technology | VOL. 17
Jignesh Bhavsar, et. al.Jignesh Bhavsar ... Kalpesh Patel
31 Jul 2024
Indian Journal Of Science And Technology | VOL. 17

Subspace Image Representation for Facial Expression Analysis and Face Recognition and its Relation to the Human Visual System
Ioan Buciu ... Ioannis Pitas
-
Ioan Buciu, et. al.Ioan Buciu ... Ioannis Pitas
01 Jan 2009
01 Jan 2009

An embedded system for face classification in infrared video using sparse representation
Miguel Figueroa ... Payman Zarkesh-Ha
-
Miguel Figueroa, et. al.Miguel Figueroa ... Payman Zarkesh-Ha
19 Sep 2017
19 Sep 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text extraction from graphical document images using sparse representation

Abstract

Talk to us

Similar Papers