Con-Text: Text Detection for Fine-Grained Object Classification.

Sezer Karaoglu,Theo Gevers,Ran Tao,Jan C Van Gemert

doi:10.1109/tip.2017.2707805

Abstract

This paper focuses on fine-grained object classification using recognized scene text in natural images. While the state-of-the-art relies on visual cues only, this paper is the first work which proposes to combine textual and visual cues. Another novelty is the textual cue extraction. Unlike the state-of-the-art text detection methods, we focus more on the background instead of text regions. Once text regions are detected, they are further processed by two methods to perform text recognition, i.e., ABBYY commercial OCR engine and a state-of-the-art character recognition algorithm. Then, to perform textual cue encoding, bi- and trigrams are formed between the recognized characters by considering the proposed spatial pairwise constraints. Finally, extracted visual and textual cues are combined for fine-grained classification. The proposed method is validated on four publicly available data sets: ICDAR03, ICDAR13, Con-Text, and Flickr-logo. We improve the state-of-the-art end-to-end character recognition by a large margin of 15% on ICDAR03. We show that textual cues are useful in addition to visual cues for fine-grained classification. We show that textual cues are also useful for logo retrieval. Adding textual cues outperforms visual- and textual-only in fine-grained classification (70.7% to 60.3%) and logo retrieval (57.4% to 54.8%).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Con-Text: Text Detection for Fine-Grained Object Classification.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Journal: IEEE Transactions on Image Processing	Publication Date: May 24, 2017
Citations: 40

Similar Papers

Words Matter: Scene Text for Image Classification and Retrieval
Sezer Karaoglu ... Theo Gevers
IEEE Transactions on Multimedia | VOL. 19
Sezer Karaoglu, et. al.Sezer Karaoglu ... Theo Gevers
01 May 2017
IEEE Transactions on Multimedia | VOL. 19

Textual Primacy Online: Impression Formation Based on Textual and Visual Cues in Facebook Profiles
Ayellet Pelled ... Tanya Zilberstein
American Behavioral Scientist | VOL. 61
Ayellet Pelled, et. al.Ayellet Pelled ... Tanya Zilberstein
01 Jun 2017
American Behavioral Scientist | VOL. 61

Impacts of Cues on Learning and Attention in Immersive 360-Degree Video: An Eye-Tracking Study.
Rui Liu ... Xiang Xu
Frontiers in Psychology | VOL. 12
Rui Liu, et. al.Rui Liu ... Xiang Xu
27 Jan 2022
Frontiers in Psychology | VOL. 12

Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification
Xiang Bai ... Mingkun Yang
IEEE Access | VOL. 6
Xiang Bai, et. al.Xiang Bai ... Mingkun Yang
01 Jan 2018
IEEE Access | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Con-Text: Text Detection for Fine-Grained Object Classification.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing