ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection

Ye Liu,Junsong Yuan,Chang Wen Chen

doi:10.1145/3394171.3413600

Abstract

We consider the problem of Human-Object Interaction (HOI) Detection, which aims to locate and recognize HOI instances in the form of <human, action, object> in images. Most existing works treat HOIs as individual interaction categories, thus can not handle the problem of long-tail distribution and polysemy of action labels. We argue that multi-level consistencies among objects, actions and interactions are strong cues for generating semantic representations of rare or previously unseen HOIs. Leveraging the compositional and relational peculiarities of HOI labels, we propose ConsNet, a knowledge-aware framework that explicitly encodes the relations among objects, actions and interactions into an undirected graph called consistency graph, and exploits Graph Attention Networks (GATs) to propagate knowledge among HOI categories as well as their constituents. Our model takes visual features of candidate human-object pairs and word embeddings of HOI labels as inputs, maps them into visual-semantic joint embedding space and obtains detection results by measuring their similarities. We extensively evaluate our model on the challenging V-COCO and HICO-DET datasets, and results validate that our approach outperforms state-of-the-arts under both fully-supervised and zero-shot settings. Code is available at https://github.com/yeliudev/ConsNet.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Graph-based method for human-object interactions detection
Li-min Xia ... Wei Wu
Journal of Central South University | VOL. 28
Li-min Xia, et. al.Li-min Xia ... Wei Wu
01 Jan 2020
Journal of Central South University | VOL. 28

Effects of Motion-Relevant Knowledge From Unlabeled Video to Human-Object Interaction Detection.
Xue Lin ... Xixia Xu
IEEE transactions on neural networks and learning systems | VOL. 34
Xue Lin, et. al.Xue Lin ... Xixia Xu
01 Sep 2023
IEEE transactions on neural networks and learning systems | VOL. 34

Exploring the synergy between textual identity and visual signals in human-object interaction
Pinzhu An ... Zhi Tan
Image and Vision Computing | VOL. 151
Pinzhu An, et. al.Pinzhu An ... Zhi Tan
02 Sep 2024
Image and Vision Computing | VOL. 151

PPDM: Parallel Point Detection and Matching for Real-Time Human-Object Interaction Detection
Yue Liao ... Fei Wang
-
Yue Liao, et. al.Yue Liao ... Fei Wang
01 Jun 2020
01 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection

Abstract

Talk to us

Similar Papers