Head Pose Changes Research Articles

We propose original semantic labels for detailed face parsing to improve the accuracy of face recognition by focusing on parts in a face. The part labels used in conventional face parsing are defined based on biological features, and thus, one label is given to a large region, such as skin. Our semantic labels are defined by separating parts with large areas based on the structure of the face and considering the left and right sides for all parts to consider head pose changes, occlusion, and other factors. By utilizing the capability of assigning detailed part labels to face images, we propose a novel data augmentation method based on detailed face parsing called Face Semantic Erasing (FSErasing) to improve the performance of face recognition. FSErasing is to randomly mask a part of the face image based on the detailed part labels, and therefore, we can apply erasing‐type data augmentation to the face image that considers the characteristics of the face. Through experiments using public face image datasets, we demonstrate that FSErasing is effective for improving the performance of face recognition and face attribute estimation. In face recognition, adding FSErasing in training ResNet‐34 with Softmax using CelebA improves the average accuracy by 0.354 points and the average equal error rate (EER) by 0.312 points, and with ArcFace, the average accuracy and EER improve by 0.752 and 0.802 points, respectively. ResNet‐50 with Softmax using CASIA‐WebFace improves the average accuracy by 0.442 points and the average EER by 0.452 points, and with ArcFace, the average accuracy and EER improve by 0.228 points and 0.500 points, respectively. In face attribute estimation, adding FSErasing as a data augmentation method in training with CelebA improves the estimation accuracy by 0.54 points. We also apply our detailed face parsing model to visualize face recognition models and demonstrate its higher explainability than general visualization methods.

Read full abstract

IntroductionIn-scanner head motion is a common cause of reduced image quality in neuroimaging, and causes systematic brain-wide changes in cortical thickness and volumetric estimates derived from structural MRI scans. There are few widely available methods for measuring head motion during structural MRI. Here, we train a deep learning predictive model to estimate changes in head pose using video obtained from an in-scanner eye tracker during an EPI-BOLD acquisition with participants undertaking deliberate in-scanner head movements. The predictive model was used to estimate head pose changes during structural MRI scans, and correlated with cortical thickness and subcortical volume estimates. Methods21 healthy controls (age 32 ± 13 years, 11 female) were studied. Participants carried out a series of stereotyped prompted in-scanner head motions during acquisition of an EPI-BOLD sequence with simultaneous recording of eye tracker video. Motion-affected and motion-free whole brain T1-weighted MRI were also obtained. Image coregistration was used to estimate changes in head pose over the duration of the EPI-BOLD scan, and used to train a predictive model to estimate head pose changes from the video data. Model performance was quantified by assessing the coefficient of determination (R2). We evaluated the utility of our technique by assessing the relationship between video-based head pose changes during structural MRI and (i) vertex-wise cortical thickness and (ii) subcortical volume estimates. ResultsVideo-based head pose estimates were significantly correlated with ground truth head pose changes estimated from EPI-BOLD imaging in a hold-out dataset. We observed a general brain-wide overall reduction in cortical thickness with increased head motion, with some isolated regions showing increased cortical thickness estimates with increased motion. Subcortical volumes were generally reduced in motion affected scans. ConclusionsWe trained a predictive model to estimate changes in head pose during structural MRI scans using in-scanner eye tracker video. The method is independent of individual image acquisition parameters and does not require markers to be to be fixed to the patient, suggesting it may be well suited to clinical imaging and research environments. Head pose changes estimated using our approach can be used as covariates for morphometric image analyses to improve the neurobiological validity of structural imaging studies of brain development and disease.

Read full abstract

Head Pose Changes Research Articles

Related Topics

Articles published on Head Pose Changes

Enhanced Hybrid Vision Transformer with Multi-Scale Feature Integration and Patch Dropping for Facial Expression Recognition.

An effective cross-scenario remote heart rate estimation network based on global-local information and video transformer.

FSErasing: Improving Face Recognition with Data Augmentation Using Face Parsing

Point CNN:3D Face Recognition with Local Feature Descriptor and Feature Enhancement Mechanism.

Facial Expression Recognition Based on Fine-Tuned Channel-Spatial Attention Transformer.

Siamese PointNet: 3D Head Pose Estimation with Local Feature Descriptor

Tracking of rigid head motion during MRI using anEEG system.

Head Pose Estimation in Complex Environment Based on Four-Branch Feature Selective Extraction and Regional Information Exchange Fusion Network

Estimation of in-scanner head pose changes during structural MRI using a convolutional neural network trained on eye tracker video

A novel eye center localization method for multiview faces

A Novel Eye Center Localization Method for Head Poses With Large Rotations.

Hybrid Force Tracking Impedance Control-Based Autonomous Robotic System for Tooth Brushing Assistance of Disabled People

Thermal Face Recognition under Spatial Variation Conditions

On visual BMI analysis from facial images

Eye center localization in a facial image based on geometric shapes of iris and eyelid under natural variability

Estimating Audience Engagement to Predict Movie Ratings

Effect of head motion on MRI B0 field distribution.

RGB-D face recognition under various conditions via 3D constrained local model

Real-time 3D eyelids tracking from semantic edges

Conceiving Human Interaction by Visualising Depth Data of Head Pose Changes and Emotion Recognition via Facial Expressions

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Head Pose Changes Research Articles

Related Topics

Articles published on Head Pose Changes

Enhanced Hybrid Vision Transformer with Multi-Scale Feature Integration and Patch Dropping for Facial Expression Recognition.

An effective cross-scenario remote heart rate estimation network based on global-local information and video transformer.

FSErasing: Improving Face Recognition with Data Augmentation Using Face Parsing

Point CNN:3D Face Recognition with Local Feature Descriptor and Feature Enhancement Mechanism.

Facial Expression Recognition Based on Fine-Tuned Channel-Spatial Attention Transformer.

Siamese PointNet: 3D Head Pose Estimation with Local Feature Descriptor

Tracking of rigid head motion during MRI using anEEG system.

Head Pose Estimation in Complex Environment Based on Four-Branch Feature Selective Extraction and Regional Information Exchange Fusion Network

Estimation of in-scanner head pose changes during structural MRI using a convolutional neural network trained on eye tracker video

A novel eye center localization method for multiview faces

A Novel Eye Center Localization Method for Head Poses With Large Rotations.

Hybrid Force Tracking Impedance Control-Based Autonomous Robotic System for Tooth Brushing Assistance of Disabled People

Thermal Face Recognition under Spatial Variation Conditions

On visual BMI analysis from facial images

Eye center localization in a facial image based on geometric shapes of iris and eyelid under natural variability

Estimating Audience Engagement to Predict Movie Ratings

Effect of head motion on MRI B0 field distribution.

RGB-D face recognition under various conditions via 3D constrained local model

Real-time 3D eyelids tracking from semantic edges

Conceiving Human Interaction by Visualising Depth Data of Head Pose Changes and Emotion Recognition via Facial Expressions