Dual-Prior Augmented Decoding Network for Long Tail Distribution in HOI Detection

Jiayi Gao,Wei Chen,Kongming Liang,Jun Guo,Tao Wei,Zhanyu Ma

doi:10.1609/aaai.v38i3.27949

Abstract

Human object interaction detection aims at localizing human-object pairs and recognizing their interactions. Trapped by the long-tailed distribution of the data, existing HOI detection methods often have difficulty recognizing the tail categories. Many approaches try to improve the recognition of HOI tasks by utilizing external knowledge (e.g. pre-trained visual-language models). However, these approaches mainly utilize external knowledge at the HOI combination level and achieve limited improvement in the tail categories. In this paper, we propose a dual-prior augmented decoding network by decomposing the HOI task into two sub-tasks: human-object pair detection and interaction recognition. For each subtask, we leverage external knowledge to enhance the model's ability at a finer granularity. Specifically, we acquire the prior candidates from an external classifier and embed them to assist the subsequent decoding process. Thus, the long-tail problem is mitigated from a coarse-to-fine level with the corresponding external knowledge. Our approach outperforms existing state-of-the-art models in various settings and significantly boosts the performance on the tail HOI categories. The source code is available at https://github.com/PRIS-CV/DP-ADN.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dual-Prior Augmented Decoding Network for Long Tail Distribution in HOI Detection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

EPK-CLIP: External and Priori Knowledge CLIP for action recognition
Zhaoqilin Yang ... Fengjuan Wang
Expert Systems With Applications | VOL. 252
Zhaoqilin Yang, et. al.Zhaoqilin Yang ... Fengjuan Wang
10 May 2024
Expert Systems With Applications | VOL. 252

Enhancing Multiple-Choice Question Answering with Causal Knowledge
Dhairya Dalal ... Paul Buitelaar
-
Dhairya Dalal, et. al.Dhairya Dalal ... Paul Buitelaar
01 Jan 2020
01 Jan 2020

Human object interaction detection in paintings using multi-task learning
Maya Antoun ... Daniel Asmar
Digital Applications in Archaeology and Cultural Heritage | VOL. 34
Maya Antoun, et. al.Maya Antoun ... Daniel Asmar
24 Jul 2024
Digital Applications in Archaeology and Cultural Heritage | VOL. 34

End-to-End Human Object Interaction Detection with HOI Transformer
Cheng Zou ... Jian Sun
-
Cheng Zou, et. al.Cheng Zou ... Jian Sun
01 Jun 2021
01 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dual-Prior Augmented Decoding Network for Long Tail Distribution in HOI Detection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence