Fast and Accurate Ground Truth Generation for Skew-Tolerance Evaluation of Page Segmentation Algorithms

Oleg Okun,Matti Pietikäinen

doi:10.1155/asp/2006/12093

Oleg Okun, Matti Pietikäinen

Open Access

PDF Available

https://doi.org/10.1155/asp/2006/12093

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Many image segmentation algorithms are known, but often there is an inherent obstacle in the unbiased evaluation of segmentation quality: the absence or lack of a common objective representation for segmentation results. Such a representation, known as the ground truth, is a description of what one should obtain as the result of ideal segmentation, independently of the segmentation algorithm used. The creation of ground truth is a laborious process and therefore any degree of automation is always welcome. Document image analysis is one of the areas where ground truths are employed. In this paper, we describe an automated tool called GROTTO intended to generate ground truths for skewed document images, which can be used for the performance evaluation of page segmentation algorithms. Some of these algorithms are claimed to be insensitive to skew (tilt of text lines). However, this fact is usually supported only by a visual comparison of what one obtains and what one should obtain since ground truths are mostly available for upright images, that is, those without skew. As a result, the evaluation is both subjective; that is, prone to errors, and tedious. Our tool allows users to quickly and easily produce many sufficiently accurate ground truths that can be employed in practice and therefore it facilitates automatic performance evaluation. The main idea is to utilize the ground truths available for upright images and the concept of the representative square [9] in order to produce the ground truths for skewed images. The usefulness of our tool is demonstrated through a number of experiments with real-document images of complex layout.

Highlights

Segmentation is an important step in image analysis since it detects homogeneous regions whose characteristics can be computed and analyzed, for example, for discriminating between different classes of objects such as faces and nonfaces
The unbiased evaluation of segmentation results is difficult because it requires an ideal description of what one should obtain as the result of segmentation of a certain image regardless of the segmentation algorithm
As one can see from the brief discussion of various ground truthing strategies, one of the first tasks is to choose a proper representation for page regions

Summary

INTRODUCTION

Segmentation is an important step in image analysis since it detects homogeneous regions whose characteristics can be computed and analyzed, for example, for discriminating between different classes of objects such as faces and nonfaces. The unbiased evaluation of segmentation results is difficult because it requires an ideal description of what one should obtain as the result of segmentation of a certain image regardless of the segmentation algorithm This ideal description, known as the ground truth, can be utilized for judging whether segmentation is correct or not, and how well a given image is segmented. The second alternative is automated and more attractive when it is necessary to process a large number of images It uses a special (usually text) file, called a ground truth (GT), for each image, containing a description of different regions that should be detected during correct segmentation. Modern approaches to ground truth generation are briefly reviewed

BRIEF OVERVIEW OF GROUND TRUTHING STRATEGIES

OUR APPROACH TO THE PROBLEM

CONCEPT OF THE REPRESENTATIVE SQUARE

SGT GENERATION METHOD

GROTTO

Mode 1

Mode 2

Mode 3: verification of GT generation

EXPERIMENTS

Findings

DISCUSSION AND CONCLUSION

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Advances in Signal Processing	Publication Date: Mar 12, 2006
Citations: 2	License type: cc-by

R Discovery Prime

Fast and Accurate Ground Truth Generation for Skew-Tolerance Evaluation of Page Segmentation Algorithms

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: EURASIP Journal on Advances in Signal Processing

Lead the way for us

Similar Papers

Phantom-based performance evaluation: Application to brain segmentation from magnetic resonance images
Bruno Moretti ... Bernard Mazoyer
Medical Image Analysis | VOL. 4
Bruno Moretti, et. al.Bruno Moretti ... Bernard Mazoyer
28 Nov 2000
Medical Image Analysis | VOL. 4

Ground Truth for Layout Analysis Performance Evaluation
A Antonacopoulos ... D Bridson
-
A Antonacopoulos, et. al.A Antonacopoulos ... D Bridson
01 Jan 2006
01 Jan 2006

Efficient Transcript Mapping to Ease the Creation of Document Image Segmentation Ground Truth with Text-Image Alignment
Nikolaos Stamatopoulos ... Georgios Louloudis
-
Nikolaos Stamatopoulos, et. al.Nikolaos Stamatopoulos ... Georgios Louloudis
01 Nov 2010
01 Nov 2010

Labeled Array Distance Metric for Measuring Image Segmentation Quality
Maryam Berijanian ... Dirk Colbry
ELCVIA Electronic Letters on Computer Vision and Image Analysis | VOL. 23
Maryam Berijanian, et. al.Maryam Berijanian ... Dirk Colbry
12 Nov 2024
ELCVIA Electronic Letters on Computer Vision and Image Analysis | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Fast and Accurate Ground Truth Generation for Skew-Tolerance Evaluation of Page Segmentation Algorithms

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: EURASIP Journal on Advances in Signal Processing