RoI Tanh-polar transformer network for face parsing in the wild

Yiming Lin,Jie Shen,Yujiang Wang,Maja Pantic

doi:10.1016/j.imavis.2021.104190

Abstract

Face parsing aims to predict pixel-wise labels for facial components of a target face in an image. Existing approaches usually crop the target face from the input image with respect to a bounding box calculated during pre-processing, and thus can only parse inner facial Regions of Interest (RoIs). Peripheral regions like hair are ignored and nearby faces that are partially included in the bounding box can cause distractions. Moreover, these methods are only trained and evaluated on near-frontal portrait images and thus their performance for in-the-wild cases has been unexplored. To address these issues, this paper makes three contributions. First, we introduce iBugMask dataset for face parsing in the wild, which consists of 21,866 training images and 1000 testing images. The training images are obtained by augmenting an existing dataset with large face poses. The testing images are manually annotated with 11 facial regions and there are large variations in sizes, poses, expressions and background. Second, we propose RoI Tanh-polar transform that warps the whole image to a Tanh-polar representation with a fixed ratio between the face area and the context, guided by the target bounding box. The new representation contains all information in the original image, and allows for rotation equivariance in the convolutional neural networks (CNNs). Third, we propose a hybrid residual representation learning block, coined HybridBlock, that contains convolutional layers in both the Tanh-polar space and the Tanh-Cartesian space, allowing for receptive fields of different shapes in CNNs. Through extensive experiments, we show that the proposed method improves the state-of-the-art for face parsing in the wild and does not require facial landmarks for alignment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RoI Tanh-polar transformer network for face parsing in the wild

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing

Lead the way for us

Journal: Image and Vision Computing	Publication Date: May 6, 2021
Citations: 25

Similar Papers

Clinically Relevant Vulnerabilities of Deep Machine Learning Systems for Skin Cancer Diagnosis
Xinyi Du-Harpur ... Magnus D Lynch
Journal of Investigative Dermatology | VOL. 141
Xinyi Du-Harpur, et. al.Xinyi Du-Harpur ... Magnus D Lynch
12 Sep 2020
Journal of Investigative Dermatology | VOL. 141

HUMAN INTELLIGENCE BASED DEEP LEARNING TECHNIQUE FOR IMAGE SEGMENATION OF BRAIN MRI
Sahik Fareeda ... K Prasad Babu
international journal of engineering technology and management sciences | VOL. 7
Sahik Fareeda, et. al. Sahik Fareeda ... K Prasad Babu
28 Feb 2023
international journal of engineering technology and management sciences | VOL. 7

Managing Uncertainty in Geological Scenarios Using Machine Learning-Based Classification Model on Production Data
Byeongcheol Kang ... Kyungbook Lee
Geofluids | VOL. 2020
Byeongcheol Kang, et. al.Byeongcheol Kang ... Kyungbook Lee
30 Oct 2020
Geofluids | VOL. 2020

A Deep Quantum Convolutional Neural Network Based Facial Expression Recognition For Mental Health Analysis.
Sanoar Hossain ... Ranjeet Kumar Rout
IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society | VOL. 32
Sanoar Hossain, et. al.Sanoar Hossain ... Ranjeet Kumar Rout
01 Jan 2024
IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RoI Tanh-polar transformer network for face parsing in the wild

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing