Visual phrase recognition by modeling 3D spatial context of multiple objects

Lin Bai,Qingfeng Chen

doi:10.1016/j.neucom.2017.01.100

Abstract

Automatically recognizing the visual phrase of an image is a challenging issue in computer vision. In this paper, we propose a method to discover and identify the visual phrase by automatically analyzing 3D spatial geometric structure of an image. It includes two steps: (1) learning 3D spatial geometric model; and (2) recognizing visual phrase. To achieve the first goal, we propose 3D geometric models (3DSG) that jointly capture both the features of objects and 3D spatial layout among objects in a visual phrase. In the second step, we transform the visual phrase recognition into verification by measuring the similarity of spatial configuration between the given visual pattern and the 3DSG model. The nature of our method makes itself precisely determine whether the given visual pattern belongs to a specific 3DSG model or not by maximizing the joint probability of the given visual pattern and a 3DSG model. Experiments conducted on several datasets show that our model outperforms the state-of-the-art models in modeling 3D spatial geometric structure as well as recognizing visual phrase. The results also demonstrate that modeling 3D spatial configuration between objects can significantly improve the deeper image understanding.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Visual phrase recognition by modeling 3D spatial context of multiple objects

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Mar 8, 2017
Citations: 6

Similar Papers

Bayes pooling of visual phrases for object retrieval
Wenhui Jiang ... Fei Su
Multimedia Tools and Applications | VOL. 75
Wenhui Jiang, et. al.Wenhui Jiang ... Fei Su
30 Sep 2015
Multimedia Tools and Applications | VOL. 75

An Image Classification Method Based on PLSA and Visual Phrases
Yong Zhang ... Hao Yang
-
Yong Zhang, et. al.Yong Zhang ... Hao Yang
01 Dec 2016
01 Dec 2016

Discovery of Collocation Patterns: from Visual Words to Visual Phrases
Junsong Yuan ... Ying Wu
-
Junsong Yuan, et. al.Junsong Yuan ... Ying Wu
01 Jun 2007
01 Jun 2007

Descriptive visual words and visual phrases for image applications
Shiliang Zhang ... Shipeng Li
-
Shiliang Zhang, et. al.Shiliang Zhang ... Shipeng Li
19 Oct 2009
19 Oct 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visual phrase recognition by modeling 3D spatial context of multiple objects

Abstract

Talk to us

Similar Papers

More From: Neurocomputing