Gaze Estimation Based on Convolutional Structure and Sliding Window-Based Attention Mechanism.

Yujie Li,Wei Zhang,Jiaxin Ma,Xiwen Wang,Jiahui Chen

doi:10.3390/s23136226

Yujie Li, Wei Zhang + Show 3 more

Open Access

https://doi.org/10.3390/s23136226

Copy DOI

Abstract

The direction of human gaze is an important indicator of human behavior, reflecting the level of attention and cognitive state towards various visual stimuli in the environment. Convolutional neural networks have achieved good performance in gaze estimation tasks, but their global modeling capability is limited, making it difficult to further improve prediction performance. In recent years, transformer models have been introduced for gaze estimation and have achieved state-of-the-art performance. However, their slicing-and-mapping mechanism for processing local image patches can compromise local spatial information. Moreover, the single down-sampling rate and fixed-size tokens are not suitable for multiscale feature learning in gaze estimation tasks. To overcome these limitations, this study introduces a Swin Transformer for gaze estimation and designs two network architectures: a pure Swin Transformer gaze estimation model (SwinT-GE) and a hybrid gaze estimation model that combines convolutional structures with SwinT-GE (Res-Swin-GE). SwinT-GE uses the tiny version of the Swin Transformer for gaze estimation. Res-Swin-GE replaces the slicing-and-mapping mechanism of SwinT-GE with convolutional structures. Experimental results demonstrate that Res-Swin-GE significantly outperforms SwinT-GE, exhibiting strong competitiveness on the MpiiFaceGaze dataset and achieving a 7.5% performance improvement over existing state-of-the-art methods on the Eyediap dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Jul 7, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Gaze Estimation Based on Convolutional Structure and Sliding Window-Based Attention Mechanism.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Accurate Model-Based Point of Gaze Estimation on Mobile Devices.
Braiden Brousseau ... Jonathan Rose
Vision | VOL. 2
Braiden Brousseau, et. al.Braiden Brousseau ... Jonathan Rose
24 Aug 2018
Vision | VOL. 2

Eye control system based on convolutional neural network: a review
Jianbin Xiong ... Jiehao Li
Assembly Automation | VOL. 42
Jianbin Xiong, et. al.Jianbin Xiong ... Jiehao Li
29 Aug 2022
Assembly Automation | VOL. 42

Attention-guided and fine-grained feature extraction from face images for gaze estimation
Chenglin Wu ... Guannan Chen
Engineering Applications of Artificial Intelligence | VOL. 126
Chenglin Wu, et. al.Chenglin Wu ... Guannan Chen
18 Aug 2023
Engineering Applications of Artificial Intelligence | VOL. 126

A new appearance-based gaze estimation via multi-modal fusion
Huijie Yang ... Zuoyun Yang
-
Huijie Yang, et. al.Huijie Yang ... Zuoyun Yang
24 Feb 2023
24 Feb 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gaze Estimation Based on Convolutional Structure and Sliding Window-Based Attention Mechanism.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)