Automatically detecting human-object interaction by an instance part-level attention deep framework

Lin Bai,Fenglian Chen,Yang Tian

doi:10.1016/j.patcog.2022.109110

Abstract

Automatically detecting human-object interactions (HOIs) from an image is a very important but challenging task in computer vision. One of the significant problems in HOI detection is that similar human-object interactions are difficult to distinguish. Recently, many instance-centric HOI detection schemes, based on appearance features and coarse spatial information, have been proposed. These methods, however, lack the capacity of capturing and analyzing the fine-grained context between human poses and object parts, which plays a crucial role in HOI detection. To address these problems, we propose a novel instance part-level attention deep framework for HOI detection. Specifically, our approach consists of a human/object-part detection phase and an HOI detection phase. In the former phase, a part-level visual pattern estimation model is designed for capturing the fine-grained human body parts and object parts. In the latter phase, a self-attention-based deep network is proposed to learn the visual composite around the human-object pair that implicitly expresses the consistent spatial, scale, co-occurrence, and viewpoint relationships among human body parts and object parts across images, which are effective for predicting HOI. To the best of our knowledge, we are the first to propose a framework where the fine-grained part-level mutual context of a human-object pair is extracted to improve HOI detection. By comparing our approach with state-of-the-art HOI detection methods on benchmark datasets, we demonstrated that our proposed framework outperformed the existing HOI detection methods, such as significantly improving the performance of part-level visual pattern estimation, HOI detection, and the quality of the self-attention-based deep network structure.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatically detecting human-object interaction by an instance part-level attention deep framework

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Oct 20, 2022
Citations: 6

Similar Papers

Human object interaction detection in paintings using multi-task learning
Maya Antoun ... Daniel Asmar
Digital Applications in Archaeology and Cultural Heritage | VOL. 34
Maya Antoun, et. al.Maya Antoun ... Daniel Asmar
24 Jul 2024
Digital Applications in Archaeology and Cultural Heritage | VOL. 34

Detecting Human-Object Interaction via Fabricated Compositional Learning
Zhi Hou ... Baosheng Yu
-
Zhi Hou, et. al.Zhi Hou ... Baosheng Yu
01 Jun 2021
01 Jun 2021

Human object interaction detection using two-direction spatial enhancement and exclusive object prior
Lu Liu ... Robby T Tan
Pattern Recognition | VOL. 124
Lu Liu, et. al.Lu Liu ... Robby T Tan
24 Nov 2021
Pattern Recognition | VOL. 124

An Optimization Model for Human-Object Interaction Detection Inspired by Multi-features
Hailan Kuang ... Jian Dong
-
Hailan Kuang, et. al.Hailan Kuang ... Jian Dong
01 Apr 2019
01 Apr 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatically detecting human-object interaction by an instance part-level attention deep framework

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition