Words Matter: Scene Text for Image Classification and Retrieval

Sezer Karaoglu,Ran Tao,Arnold W M Smeulders,Theo Gevers

doi:10.1109/tmm.2016.2638622

Abstract

Text in natural images typically adds meaning to an object or scene. In particular, text specifies which business places serve drinks (e.g., cafe, teahouse) or food (e.g., restaurant, pizzeria), and what kind of service is provided (e.g., massage, repair). The mere presence of text, its words, and meaning are closely related to the semantics of the object or scene. This paper exploits textual contents in images for fine-grained business place classification and logo retrieval. There are four main contributions. First, we show that the textual cues extracted by the proposed method are effective for the two tasks. Combining the proposed textual and visual cues outperforms visual only classification and retrieval by a large margin. Second, to extract the textual cues, a generic and fully unsupervised word box proposal method is introduced. The method reaches state-of-the-art word detection recall with a limited number of proposals. Third, contrary to what is widely acknowledged in text detection literature, we demonstrate that high recall in word detection is more important than high f-score at least for both tasks considered in this work. Last, this paper provides a large annotated text detection dataset with 10 K images and 27 601 word boxes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Words Matter: Scene Text for Image Classification and Retrieval

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Journal: IEEE Transactions on Multimedia	Publication Date: May 1, 2017
Citations: 100

Similar Papers

Con-Text: Text Detection for Fine-Grained Object Classification.
Sezer Karaoglu ... Jan C Van Gemert
IEEE Transactions on Image Processing | VOL. 26
Sezer Karaoglu, et. al.Sezer Karaoglu ... Jan C Van Gemert
24 May 2017
IEEE Transactions on Image Processing | VOL. 26

Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification
Xiang Bai ... Mingkun Yang
IEEE Access | VOL. 6
Xiang Bai, et. al.Xiang Bai ... Mingkun Yang
01 Jan 2018
IEEE Access | VOL. 6

Detecting of Vertically-Oriented Texts in Images Containing Natural Scenes
Yi Ling Ong ... Almon Chai
-
Yi Ling Ong, et. al.Yi Ling Ong ... Almon Chai
07 Dec 2020
07 Dec 2020

Learning and Fusing Multi-Scale Representations for Accurate Arbitrary-Shaped Scene Text Recognition
Mingjun Li ... Feng Su
-
Mingjun Li, et. al.Mingjun Li ... Feng Su
12 Jun 2023
12 Jun 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Words Matter: Scene Text for Image Classification and Retrieval

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia