Toward Open-Set Human Object Interaction Detection

Mingrui Wu,Xiaoshuai Sun,Rongrong Ji,Jiayi Ji,Yuqi Liu

doi:10.1609/aaai.v38i6.28422

Abstract

This work is oriented toward the task of open-set Human Object Interaction (HOI) detection. The challenge lies in identifying completely new, out-of-domain relationships, as opposed to in-domain ones which have seen improvements in zero-shot HOI detection. To address this challenge, we introduce a simple Disentangled HOI Detection (DHD) model for detecting novel relationships by integrating an open-set object detector with a Visual Language Model (VLM). We utilize a disentangled image-text contrastive learning metric for training and connect the bottom-up visual features to text embeddings through lightweight unary and pair-wise adapters. Our model can benefit from the open-set object detector and the VLM to detect novel action categories and combine actions with novel object categories. We further present the VG-HOI dataset, a comprehensive benchmark with over 17k HOI relationships for open-set scenarios. Experimental results show that our model can detect unknown action classes and combine unknown object classes. Furthermore, it can generalize to over 17k HOI classes while being trained on just 600 HOI classes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Toward Open-Set Human Object Interaction Detection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Human object interaction detection in paintings using multi-task learning
Maya Antoun ... Daniel Asmar
Digital Applications in Archaeology and Cultural Heritage | VOL. 34
Maya Antoun, et. al.Maya Antoun ... Daniel Asmar
24 Jul 2024
Digital Applications in Archaeology and Cultural Heritage | VOL. 34

An Optimization Model for Human-Object Interaction Detection Inspired by Multi-features
Hailan Kuang ... Jian Dong
-
Hailan Kuang, et. al.Hailan Kuang ... Jian Dong
01 Apr 2019
01 Apr 2019

Detecting Human-Object Interaction via Fabricated Compositional Learning
Zhi Hou ... Baosheng Yu
-
Zhi Hou, et. al.Zhi Hou ... Baosheng Yu
01 Jun 2021
01 Jun 2021

End-to-End Human Object Interaction Detection with HOI Transformer
Cheng Zou ... Jian Sun
-
Cheng Zou, et. al.Cheng Zou ... Jian Sun
01 Jun 2021
01 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Toward Open-Set Human Object Interaction Detection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence