A new appearance-based gaze estimation via multi-modal fusion

Huijie Yang,Jiahui Liu,Jiannan Chi,Zuoyun Yang

doi:10.1109/nnice58320.2023.10105698

Abstract

Appearance-based gaze estimation has gained more and more attention because of its generality, robustness, and subject independence. Deep learning, which has made a great deal of success in computer vision, has also greatly improved the accuracy of appearance-based gaze estimation. To further reduce the error in gaze estimation, we focus on extracting better feature information from eye and face images. In this paper, we propose a novel multimodal fusion gaze estimation model based on ConvNext and dilated convolution. In this model, the eye image and face image are used as input, and the ConvNext network is used to extract the features of the face image and the eye features are extracted by a dilated convolution-based network, and the feature map of the two images are fused using the fully connected layer to perform gaze estimation. In the experimental part, the designed model is verified on the public dataset MPIIGaze, and compared the proposed model with other gaze estimation models. The experimental results show that our proposed method has greatly improved the accuracy of gaze estimation on the MPIIGaze dataset compared to other related works. Our proposed multimodal fusion gaze estimation model achieves state-of-the-art result on the MPIIGaze dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A new appearance-based gaze estimation via multi-modal fusion

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Deep Learning Approach to Appearance-Based Gaze Estimation under Head Pose Variations
Hsin-Pei Sun ... Shang-Hong Lai
-
Hsin-Pei Sun, et. al.Hsin-Pei Sun ... Shang-Hong Lai
01 Nov 2017
01 Nov 2017

Multi-feature fusion gaze estimation based on attention mechanism
Zhangfang Hu ... Lan Wang
-
Zhangfang Hu, et. al.Zhangfang Hu ... Lan Wang
09 Oct 2021
09 Oct 2021

Gaze Estimation via Modulation-Based Adaptive Network With Auxiliary Self-Learning
Yong Wu ... Yang Wang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32
Yong Wu, et. al.Yong Wu ... Yang Wang
01 Aug 2022
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32

Appearance-based gaze estimation under slight head motion
Zhizhi Guo ... Qianxiang Zhou
Multimedia Tools and Applications | VOL. 76
Zhizhi Guo, et. al.Zhizhi Guo ... Qianxiang Zhou
09 Jan 2016
Multimedia Tools and Applications | VOL. 76

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A new appearance-based gaze estimation via multi-modal fusion

Abstract

Talk to us

Similar Papers