Integrating Visual and Textual Cues for Query-by-String Word Spotting

David Aldavert,Josep Llados,Marcal Rusinol,Ricardo Toledo

doi:10.1109/icdar.2013.108

Integrating Visual and Textual Cues for Query-by-String Word Spotting

David Aldavert, Josep Llados + Show 2 more

Open Access

https://doi.org/10.1109/icdar.2013.108

Copy DOI

Publication Date: Aug 1, 2013

Citations: 72

#Textual Representation #Word Spotting + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this paper, we present a word spotting framework that follows the query-by-string paradigm where word images are represented both by textual and visual representations. The textual representation is formulated in terms of character n-grams while the visual one is based on the bag-of-visual-words scheme. These two representations are merged together and projected to a sub-vector space. This transform allows to, given a textual query, retrieve word instances that were only represented by the visual modality. Moreover, this statistical representation can be used together with state-of-the-art indexation structures in order to deal with large-scale scenarios. The proposed method is evaluated using a collection of historical documents outperforming state-of-the-art performances.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.