Gaze Target Estimation Inspired by Interactive Attention

Zhengxi Hu,Shichao Wu,Hang Guo,Yuxue Yang,Kunxu Zhao,Bohan Zhou,Jingtai Liu

doi:10.1109/tcsvt.2022.3190314

Abstract

As an essential nonverbal cue, the human gaze reveals human intentions and plays a crucial role in human daily activities. Therefore, automatic detection of the person’s gaze target has drawn the interests of the computer vision community. This is useful not only for identifying whether children are attentive in class but also for locating items of interest to humans in retail settings. Existing gaze-following methods have only explored and exploited the scenes context and the head cues. Considering the significance of human-object interaction in understanding human intentions, we present the Visual-Spatial Graph and introduce a graph attention network to analyze the interaction probability between the human and elements in the scene. Then the interaction probability inferred from the visual-spatial information that is aggregated by the attention mechanism can be transformed into an interactive attention map that depicts the areas people care about. In addition, we construct a transformer as an encoder to integrate the features extracted by the scene and head pathways aiming to decode the gaze target. After introducing interactive attention, our proposed method achieves outstanding performance on two benchmarks: GazeFollow and VideoAttentionTarget. Our code is available at <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/nkuhzx/VSG-IA</uri> .

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Gaze Target Estimation Inspired by Interactive Attention

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Dec 1, 2022
Citations: 6

Similar Papers

We Know Where They Are Looking at From the RGB-D Camera: Gaze Following in 3D
Zhengxi Hu ... Shichao Wu
IEEE Transactions on Instrumentation and Measurement | VOL. 71
Zhengxi Hu, et. al.Zhengxi Hu ... Shichao Wu
01 Jan 2021
IEEE Transactions on Instrumentation and Measurement | VOL. 71

Improving performance of human action intent recognition: Analysis of gait recognition machine learning algorithms and optimal combination with inertial measurement units
Yifan Liu ... Xingjun Wang
Computers in Biology and Medicine | VOL. 163
Yifan Liu, et. al.Yifan Liu ... Xingjun Wang
22 Jun 2023
Computers in Biology and Medicine | VOL. 163

Gaze Target Detection by Merging Human Attention and Activity Cues
Yaokun Yang ... Feng Lu
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence | VOL. 38
Yaokun Yang, et. al.Yaokun Yang ... Feng Lu
24 Mar 2024
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence | VOL. 38

How did human dwelling and working intensity change over different stages of COVID-19 in Beijing?
Yaxi Liu ... Chenghu Zhou
Sustainable cities and society | VOL. 74
Yaxi Liu, et. al.Yaxi Liu ... Chenghu Zhou
27 Jul 2021
Sustainable cities and society | VOL. 74

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gaze Target Estimation Inspired by Interactive Attention

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society