Context-Dependent Diffusion Network for Visual Relationship Detection

Zhen Cui,Jian Yang,Chunyan Xu,Wenming Zheng

doi:10.1145/3240508.3240668

Abstract

Visual relationship detection can bridge the gap between computer vision and natural language for scene understanding of images. Different from pure object recognition tasks, the relation triplets of subject-predicate-object lie on an extreme diversity space, such as \textit{person-behind-person} and \textit{car-behind-building}, while suffering from the problem of combinatorial explosion. In this paper, we propose a context-dependent diffusion network (CDDN) framework to deal with visual relationship detection. To capture the interactions of different object instances, two types of graphs, word semantic graph and visual scene graph, are constructed to encode global context interdependency. The semantic graph is built through language priors to model semantic correlations across objects, whilst the visual scene graph defines the connections of scene objects so as to utilize the surrounding scene information. For the graph-structured data, we design a diffusion network to adaptively aggregate information from contexts, which can effectively learn latent representations of visual relationships and well cater to visual relationship detection in view of its isomorphic invariance to graphs. Experiments on two widely-used datasets demonstrate that our proposed method is more effective and achieves the state-of-the-art performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Context-Dependent Diffusion Network for Visual Relationship Detection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Visual relationship detection with region topology structure
Le Zhang ... Zhenxi Zhang
Information Sciences | VOL. 564
Le Zhang, et. al.Le Zhang ... Zhenxi Zhang
09 Feb 2021
Information Sciences | VOL. 564

Visual Information Oriented Knowledge Graph
Jinglei Lou ... Yu Cao
-
Jinglei Lou, et. al.Jinglei Lou ... Yu Cao
20 Oct 2021
20 Oct 2021

Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection
Xiaodan Liang ... Lisa Lee
-
Xiaodan Liang, et. al.Xiaodan Liang ... Lisa Lee
01 Jul 2017
01 Jul 2017

Iterative Visual Relationship Detection via Commonsense Knowledge Graph
Hai Wan ... Jeff Z Pan
-
Hai Wan, et. al.Hai Wan ... Jeff Z Pan
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Context-Dependent Diffusion Network for Visual Relationship Detection

Abstract

Talk to us

Similar Papers