Improving human–object interaction with auxiliary semantic information and enhanced instance representation

Khang Nguyen,Thinh V Le,Huyen Ngoc N Van,Doanh C Bui

doi:10.1016/j.patrec.2023.09.013

Abstract

Human–Object Interaction (HOI) detection has garnered considerable attention among computer vision researchers as it involves identifying and describing actions between humans and objects. Numerous approaches, such as sequential and end-to-end methods, have been proposed to tackle this problem, with a recent focus on exploring end-to-end systems. This study presents an enhanced end-to-end transformer-based human–object detector based on HOTR, which introduces three improvements. The proposed model improves instance representation through a simple yet effective mechanism, utilizes semantic information to provide contextual understanding and additional knowledge, and incorporates a cross-attention mechanism for fusing multi-level high-level feature maps within the Transformer architecture. Experimental results demonstrate significant performance gains over the baseline HOTR model, making it competitive with other state-of-the-art models on two widely-used datasets: V-COCO and HICO-DET.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving human–object interaction with auxiliary semantic information and enhanced instance representation

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Oct 13, 2023
Citations: 2

Similar Papers

Human object interactions recognition based on social network analysis
Guang Yang ... Hong Man
-
Guang Yang, et. al. Guang Yang ... Hong Man
01 Oct 2013
01 Oct 2013

Toward a Unified Transformer-Based Framework for Scene Graph Generation and Human-Object Interaction Detection.
Tao He ... Yuan-Fang Li
IEEE Transactions on Image Processing | VOL. 32
Tao He, et. al.Tao He ... Yuan-Fang Li
01 Jan 2023
IEEE Transactions on Image Processing | VOL. 32

Semantic Inference Network for Human-Object Interaction Detection
Hongyi Liu ... Huimin Ma
-
Hongyi Liu, et. al.Hongyi Liu ... Huimin Ma
01 Jan 2019
01 Jan 2019

SGAP-Net: Semantic-Guided Attentive Prototypes Network for Few-Shot Human-Object Interaction Recognition
Zhong Ji ... Yanwei Pang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Zhong Ji, et. al.Zhong Ji ... Yanwei Pang
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving human–object interaction with auxiliary semantic information and enhanced instance representation

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters