RGBD Based Gaze Estimation via Multi-Task CNN

Dongze Lian,Weixin Luo,Jingyi Yu,Shenghua Gao,Lina Hu,Minye Wu,Zechao Li,Ziheng Zhang

doi:10.1609/aaai.v33i01.33012488

Abstract

This paper tackles RGBD based gaze estimation with Convolutional Neural Networks (CNNs). Specifically, we propose to decompose gaze point estimation into eyeball pose, head pose, and 3D eye position estimation. Compared with RGB image-based gaze tracking, having depth modality helps to facilitate head pose estimation and 3D eye position estimation. The captured depth image, however, usually contains noise and black holes which noticeably hamper gaze tracking. Thus we propose a CNN-based multi-task learning framework to simultaneously refine depth images and predict gaze points. We utilize a generator network for depth image generation with a Generative Neural Network (GAN), where the generator network is partially shared by both the gaze tracking network and GAN-based depth synthesizing. By optimizing the whole network simultaneously, depth image synthesis improves gaze point estimation and vice versa. Since the only existing RGBD dataset (EYEDIAP) is too small, we build a large-scale RGBD gaze tracking dataset for performance evaluation. As far as we know, it is the largest RGBD gaze dataset in terms of the number of participants. Comprehensive experiments demonstrate that our method outperforms existing methods by a large margin on both our dataset and the EYEDIAP dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RGBD Based Gaze Estimation via Multi-Task CNN

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jul 17, 2019
Citations: 26

Similar Papers

RGB-D-based gaze point estimation via multi-column CNNs and facial landmarks global optimization
Ziheng Zhang ... Dongze Lian
The Visual Computer | VOL. 37
Ziheng Zhang, et. al.Ziheng Zhang ... Dongze Lian
30 Oct 2020
The Visual Computer | VOL. 37

A probabilistic framework for joint head tracking and pose estimation
...
-
, et. al. ...
23 Aug 2004
23 Aug 2004

A probabilistic framework for joint head tracking and pose estimation
S.O Ba ... J.M Odobez
-
S.O Ba, et. al.S.O Ba ... J.M Odobez
01 Jan 2004
01 Jan 2004

POSEidon: Face-from-Depth for Driver Pose Estimation
Guido Borghi ... Roberto Vezzani
-
Guido Borghi, et. al.Guido Borghi ... Roberto Vezzani
01 Jul 2017
01 Jul 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RGBD Based Gaze Estimation via Multi-Task CNN

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence