ON THE INFLUENCE OF WORD REPRESENTATIONS FOR HANDWRITTEN WORD SPOTTING IN HISTORICAL DOCUMENTS

Josep Lladós,Marçal Rusiñol,Alicia Fornés,David Fernández,Anjan Dutta

doi:10.1142/s0218001412630025

Abstract

Word spotting is the process of retrieving all instances of a queried keyword from a digital library of document images. In this paper we evaluate the performance of different word descriptors to assess the advantages and disadvantages of statistical and structural models in a framework of query-by-example word spotting in historical documents. We compare four word representation models, namely sequence alignment using DTW as a baseline reference, a bag of visual words approach as statistical model, a pseudo-structural model based on a Loci features representation, and a structural approach where words are represented by graphs. The four approaches have been tested with two collections of historical data: the George Washington database and the marriage records from the Barcelona Cathedral. We experimentally demonstrate that statistical representations generally give a better performance, however it cannot be neglected that large descriptors are difficult to be implemented in a retrieval scenario where word spotting requires the indexation of data with million word images.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ON THE INFLUENCE OF WORD REPRESENTATIONS FOR HANDWRITTEN WORD SPOTTING IN HISTORICAL DOCUMENTS

Abstract

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence

Lead the way for us

Journal: International Journal of Pattern Recognition and Artificial Intelligence	Publication Date: Aug 1, 2012
Citations: 47

Similar Papers

Partial duplicate image retrieval using fast visual word generation technique
S N Bhojane ... P R Futane
-
S N Bhojane, et. al.S N Bhojane ... P R Futane
01 Sep 2015
01 Sep 2015

A Novel Method for Scene Categorization Using an Improved Visual Vocabulary Approach
Tarek Elguebaly ... Nizar Bouguila
-
Tarek Elguebaly, et. al.Tarek Elguebaly ... Nizar Bouguila
01 Jan 2014
01 Jan 2014

Improving codebook generation for action recognition using a mixture of Asymmetric Gaussians
Tarek Elguebaly ... Nizar Bouguila
-
Tarek Elguebaly, et. al.Tarek Elguebaly ... Nizar Bouguila
01 Dec 2014
01 Dec 2014

Word spotting and recognition via a joint deep embedding of image and text
Mohamed Mhiri ... Mohamed Cheriet
Pattern Recognition | VOL. 88
Mohamed Mhiri, et. al.Mohamed Mhiri ... Mohamed Cheriet
20 Nov 2018
Pattern Recognition | VOL. 88

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ON THE INFLUENCE OF WORD REPRESENTATIONS FOR HANDWRITTEN WORD SPOTTING IN HISTORICAL DOCUMENTS

Abstract

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence