Abstract

The block segmentation and block of digitized printed documents segmented into region of texts, graphics, tables and images are very important in document analysis and understanding. Conventionally, the Constrained Run Length Algorithm (CRLA) has been proposed to segment digited document, but failure may occur due to improper constraints. Especially, it usually leads to failure about block segmentation when the documents are complicated and inclined. They could only deal with the text part for block without any certain rules, and couldn't succeed in effective and even lead to wrong classification. In this paper, a powerful approach for document analysis named Automatic Local Sequential Segmentation and Hierarchical classification is proposed. Our results show that this algorithm is an efficient approach for block segmentation and block classification.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call