Appearance-Based Gaze Estimation Method Using Static Transformer Temporal Differential Network

Yujie Li,Benying Tan,Jiahui Chen,Xiwen Wang,Longzhao Huang

doi:10.3390/math11030686

Abstract

Gaze behavior is important and non-invasive human–computer interaction information that plays an important role in many fields—including skills transfer, psychology, and human–computer interaction. Recently, improving the performance of appearance-based gaze estimation, using deep learning techniques, has attracted increasing attention: however, several key problems in these deep-learning-based gaze estimation methods remain. Firstly, the feature fusion stage is not fully considered: existing methods simply concatenate the different obtained features into one feature, without considering their internal relationship. Secondly, dynamic features can be difficult to learn, because of the unstable extraction process of ambiguously defined dynamic features. In this study, we propose a novel method to consider feature fusion and dynamic feature extraction problems. We propose the static transformer module (STM), which uses a multi-head self-attention mechanism to fuse fine-grained eye features and coarse-grained facial features. Additionally, we propose an innovative recurrent neural network (RNN) cell—that is, the temporal differential module (TDM)—which can be used to extract dynamic features. We integrated the STM and the TDM into the static transformer with a temporal differential network (STTDN). We evaluated the STTDN performance, using two publicly available datasets (MPIIFaceGaze and Eyediap), and demonstrated the effectiveness of the STM and the TDM. Our results show that the proposed STTDN outperformed state-of-the-art methods, including that of Eyediap (by 2.9%).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematics	Publication Date: Jan 29, 2023
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Appearance-Based Gaze Estimation Method Using Static Transformer Temporal Differential Network

Abstract

Talk to us

Similar Papers

More From: Mathematics

Lead the way for us

Similar Papers

Designing of VehiNet Using Convolutional Neural Networks and Deep Learning Techniques
Mahita Kandala ... Prakash Duraisamy
Procedia Computer Science | VOL. 235
Mahita Kandala, et. al.Mahita Kandala ... Prakash Duraisamy
01 Jan 2024
Procedia Computer Science | VOL. 235

A deceptive review detection framework: Combination of coarse and fine-grained features
Ning Cao ... Xiaohong Sun
Expert Systems with Applications | VOL. 156
Ning Cao, et. al.Ning Cao ... Xiaohong Sun
22 Apr 2020
Expert Systems with Applications | VOL. 156

Learning a 3D Gaze Estimator With Adaptive Weighted Strategy
Xiaolong Zhou ... Qianqian Liu
IEEE Access | VOL. 8
Xiaolong Zhou, et. al.Xiaolong Zhou ... Qianqian Liu
01 Jan 2020
IEEE Access | VOL. 8

EM-Gaze: eye context correlation and metric learning for gaze estimation
Jinchao Zhou ... Miao Wang
Visual computing for industry, biomedicine, and art | VOL. 6
Jinchao Zhou, et. al.Jinchao Zhou ... Miao Wang
05 May 2023
Visual computing for industry, biomedicine, and art | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Appearance-Based Gaze Estimation Method Using Static Transformer Temporal Differential Network

Abstract

Talk to us

Similar Papers

More From: Mathematics