Visual Relation Detection with Multi-Level Attention

Sipeng Zheng,Qin Jin,Shizhe Chen

doi:10.1145/3343031.3350962

Abstract

Visual relations, which describe various types of interactions between two objects in the image, can provide critical information for comprehensive semantic understanding of the image. Multiple cues related to the objects can contribute to visual relation detection, which mainly include appearances, spacial locations and semantic meanings. It is of great importance to represent different cues and combine them effectively for visual relation detection. However, in previous works, the appearance representation is simply realized by global visual representation based on the bounding boxes of objects, which may not capture salient regions of the interaction between two objects, and the different cue representations are equally concatenated without considering their different contributions for different relations. In this work, we propose a multi-level attention visual relation detection model (MLA-VRD), which generates salient appearance representation via a multi-stage appearance attention strategy and adaptively combine different cues with different importance weighting via a multi-cue attention strategy. Extensive experiment results on two widely used visual relation detection datasets, VRD and Visual Genome, demonstrate the effectiveness of our proposed model which significantly outperforms the previous state-of-the-arts. Our proposed model also achieves superior performance under the zero-shot learning condition, which is an important ordeal for testing the generalization ability of visual relation detection models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Visual Relation Detection with Multi-Level Attention

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Adaptive depth-aware visual relationship detection
Ming-Gang Gan ... Yuxuan He
Knowledge-Based Systems | VOL. 247
Ming-Gang Gan, et. al.Ming-Gang Gan ... Yuxuan He
18 Apr 2022
Knowledge-Based Systems | VOL. 247

Iterative Visual Relationship Detection via Commonsense Knowledge Graph
Hai Wan ... Jeff Z Pan
-
Hai Wan, et. al.Hai Wan ... Jeff Z Pan
01 Jan 2020
01 Jan 2020

Visual Relation Detection using Hybrid Analogical Learning
Kezhen Chen ... Ken Forbus
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Kezhen Chen, et. al.Kezhen Chen ... Ken Forbus
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Visual Relationship Detection With Image Position and Feature Information Embedding and Fusion
Jinghui Peng ... Ying Zhang
IEEE Access | VOL. 10
Jinghui Peng, et. al.Jinghui Peng ... Ying Zhang
01 Jan 2021
IEEE Access | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visual Relation Detection with Multi-Level Attention

Abstract

Talk to us

Similar Papers