Distance-Aware Occlusion Detection with Focused Attention.

Yang Li,Hao Zhao,Yucheng Tu,Xiaoxue Chen,Guyue Zhou

doi:10.1109/tip.2022.3197984

Abstract

For humans, understanding the relationships between objects using visual signals is intuitive. For artificial intelligence, however, this task remains challenging. Researchers have made significant progress studying semantic relationship detection, such as human-object interaction detection and visual relationship detection. We take the study of visual relationships a step further from semantic to geometric. In specific, we predict relative occlusion and relative distance relationships. However, detecting these relationships from a single image is challenging. Enforcing focused attention to task-specific regions plays a critical role in successfully detecting these relationships. In this work, (1) we propose a novel three-decoder architecture as the infrastructure for focused attention; 2) we use the generalized intersection box prediction task to effectively guide our model to focus on occlusion-specific regions; 3) our model achieves a new state-of-the-art performance on distance-aware relationship detection. Specifically, our model increases the distance F1-score from 33.8% to 38.6% and boosts the occlusion F1-score from 34.4% to 41.2%. Our code and data will be publicly available.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Image Processing	Publication Date: Jan 1, 2022
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Distance-Aware Occlusion Detection with Focused Attention.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Similar Papers

Human object interaction detection in paintings using multi-task learning
Maya Antoun ... Daniel Asmar
Digital Applications in Archaeology and Cultural Heritage | VOL. 34
Maya Antoun, et. al.Maya Antoun ... Daniel Asmar
24 Jul 2024
Digital Applications in Archaeology and Cultural Heritage | VOL. 34

Exploring the synergy between textual identity and visual signals in human-object interaction
Pinzhu An ... Zhi Tan
Image and Vision Computing | VOL. 151
Pinzhu An, et. al.Pinzhu An ... Zhi Tan
02 Sep 2024
Image and Vision Computing | VOL. 151

Detecting Human-Object Interaction via Fabricated Compositional Learning
Zhi Hou ... Baosheng Yu
-
Zhi Hou, et. al.Zhi Hou ... Baosheng Yu
01 Jun 2021
01 Jun 2021

UAHOI: Uncertainty-aware robust interaction learning for HOI detection
Mu Chen ... Yi Yang
Computer Vision and Image Understanding | VOL. 247
Mu Chen, et. al.Mu Chen ... Yi Yang
20 Jul 2024
Computer Vision and Image Understanding | VOL. 247

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Distance-Aware Occlusion Detection with Focused Attention.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing