Parallel disentangling network for human–object interaction detection

Yamin Cheng,Hancong Duan,Chen Wang,Zhijun Chen

doi:10.1016/j.patcog.2023.110021

Abstract

Human–object interaction (HOI) detection aims to localize and classify triplets of human, object and interaction from a given image. Earlier two-stage methods suffer both from mutually independent training processes and the interference of redundant negative human–object pairs. Prevailing one-stage transformer-based methods are free from the above problems by tackling HOI in an end-to-end manner. However, one-stage transformer-based methods carry the unnecessary entanglements of the query for different tasks, i.e., human–object detection and interaction classification, and thus bring in poor performance. In this paper, we propose a new transformer-based approach that parallelly disentangles human–object detection and interaction classification in a triplet-wise manner. To make each query focus on one specific task clearly, we exhaustively disentangle HOI by parallelly expanding the naive query in vanilla transformer as triple explicit queries. Then, we introduce a semantic communication layer to preserve the consistent semantic association of each HOI through mixing the feature representations of each query triplet of the correspondence constraint. Extensive experiments demonstrate that our proposed framework outperforms the existing methods and achieves the state-of-the-art performance, with significant reduction in parameters and FLOPs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parallel disentangling network for human–object interaction detection

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Oct 4, 2023
Citations: 3

Similar Papers

Pairwise CNN-Transformer Features for Human-Object Interaction Detection.
Hutuo Quan ... Junkai Li
Entropy | VOL. 26
Hutuo Quan, et. al.Hutuo Quan ... Junkai Li
27 Feb 2024
Entropy | VOL. 26

End-to-End Human Object Interaction Detection with HOI Transformer
Cheng Zou ... Jian Sun
-
Cheng Zou, et. al.Cheng Zou ... Jian Sun
01 Jun 2021
01 Jun 2021

Human object interaction detection in paintings using multi-task learning
Maya Antoun ... Daniel Asmar
Digital Applications in Archaeology and Cultural Heritage | VOL. 34
Maya Antoun, et. al.Maya Antoun ... Daniel Asmar
24 Jul 2024
Digital Applications in Archaeology and Cultural Heritage | VOL. 34

An Optimization Model for Human-Object Interaction Detection Inspired by Multi-features
Hailan Kuang ... Jian Dong
-
Hailan Kuang, et. al.Hailan Kuang ... Jian Dong
01 Apr 2019
01 Apr 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parallel disentangling network for human–object interaction detection

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition