Accurate Head Pose Estimation Research Articles

Under the severe situation of the COVID-19 pandemic, masks cover most of the effective facial features of users, and their head pose changes significantly in a complex environment, which makes the accuracy of head pose estimation in some systems such as safe driving systems and attention detection systems impossible to guarantee. To this end, we propose a powerful four-branch feature selective extraction network (FSEN) structure, in which three branches are used to extract three independent discriminative features of pose angles, and one branch is used to extract composite features corresponding to multiple pose angles. By reducing the dimension of high-dimensional features, our method significantly reduces the amount of computation while improving the estimation accuracy. Our convolution method is an improved spatial channel dynamic convolution (SCDC) that initially enhances the extracted features. Additionally, we embed a regional information exchange network (RIEN) after each convolutional layer in each branch to fully mine the potential semantic correlation between regions from multiple perspectives and learn and fuse this correlation to further enhance feature expression. Finally, we fuse the independent discriminative features of each pose angle and composite features from the three directions of channel, space, and pixel to obtain perfect feature expression for each pose angle, and then obtain the head pose angle. We conducted extensive experiments on the controlled environment datasets and a self-built real complex environment dataset (RCE) and the results showed that our method outperforms state-of-the-art single-modality methods and performs on par with multimodality-based methods. This shows that our network meets the requirements of accurate head-pose estimation in real complex environments such as complex illumination and partial occlusion.

Read full abstract

Accurate head pose estimation from 2D image data is an essential component of applications such as driver monitoring systems, virtual reality technology, and human-computer interaction. It enables a better determination of user engagement and attentiveness. The most accurate head pose estimators are based on Deep Neural Networks that are trained with the supervised approach and rely primarily on the accuracy of training data. The acquisition of real head pose data with a wide variation of yaw, pitch and roll is a challenging task. Publicly available head pose datasets have limitations with respect to size, resolution, annotation accuracy and diversity. In this work, a methodology is proposed to generate pixel-perfect synthetic 2D headshot images rendered from high-quality 3D synthetic facial models with accurate head pose annotations. A diverse range of variations in age, race, and gender are also provided. The resulting dataset includes more than 300k pairs of RGB images with corresponding head pose annotations. A wide range of variations in pose, illumination and background are included. The dataset is evaluated by training a state-of-the-art head pose estimation model and testing against the popular evaluation-dataset Biwi. The results show that training with purely synthetic data generated using the proposed methodology achieves close to state-of-the-art results on head pose estimation which are originally trained on real human facial datasets. As there is a domain gap between the synthetic images and real-world images in the feature space, initial experimental results fall short of the current state-of-the-art. To reduce the domain gap, a semi-supervised visual domain adaptation approach is proposed, which simultaneously trains with the labelled synthetic data and the unlabeled real data. When domain adaptation is applied, a significant improvement in model performance is achieved. Additionally, by applying a data fusion-based transfer learning approach, better results are achieved than previously published work on this topic.

Read full abstract

Accurate Head Pose Estimation Research Articles

Related Topics

Articles published on Accurate Head Pose Estimation

Self-Attention Mechanism-Based Head Pose Estimation Network with Fusion of Point Cloud and Image Features.

Accurate Head Pose Estimation Using Image Rectification and a Lightweight Convolutional Neural Network

Head Pose Estimation in Complex Environment Based on Four-Branch Feature Selective Extraction and Regional Information Exchange Fusion Network

Head pose estimation using deep neural networks and 3D point clouds

Learning 3D Head Pose From Synthetic Data: A Semi-Supervised Approach

Real-Time Head Pose Estimation and Face Modeling From a Depth Image

Head Pose Estimation in the Wild Assisted by Facial Landmarks Based on Convolutional Neural Networks

Learning toward practical head pose estimation

Model-free non-rigid head pose tracking by joint shape and pose estimation

Training‐based head pose estimation under monocular vision

Head Detection by the Adaboost Algorithm for Head Pose Estimation

An effective head pose estimation approach using Lie Algebrized Gaussians based face representation

A novel feature descriptor based on biologically inspired feature for head pose estimation

Iterative three-dimensional head pose estimation using a face normal vector

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Accurate Head Pose Estimation Research Articles

Related Topics

Articles published on Accurate Head Pose Estimation

Self-Attention Mechanism-Based Head Pose Estimation Network with Fusion of Point Cloud and Image Features.

Accurate Head Pose Estimation Using Image Rectification and a Lightweight Convolutional Neural Network

Head Pose Estimation in Complex Environment Based on Four-Branch Feature Selective Extraction and Regional Information Exchange Fusion Network

Head pose estimation using deep neural networks and 3D point clouds

Learning 3D Head Pose From Synthetic Data: A Semi-Supervised Approach

Real-Time Head Pose Estimation and Face Modeling From a Depth Image

Head Pose Estimation in the Wild Assisted by Facial Landmarks Based on Convolutional Neural Networks

Learning toward practical head pose estimation

Model-free non-rigid head pose tracking by joint shape and pose estimation

Training‐based head pose estimation under monocular vision

Head Detection by the Adaboost Algorithm for Head Pose Estimation

An effective head pose estimation approach using Lie Algebrized Gaussians based face representation

A novel feature descriptor based on biologically inspired feature for head pose estimation

Iterative three-dimensional head pose estimation using a face normal vector