Augmented EMTCNN: A Fast and Accurate Facial Landmark Detection Network

Hyeon-Woo Kim,Seungmin Rho,Eenjun Hwang,Hyung-Joon Kim

doi:10.3390/app10072253

Hyeon-Woo Kim, Seungmin Rho + Show 2 more

Open Access

https://doi.org/10.3390/app10072253

Copy DOI

Journal: Applied Sciences	Publication Date: Mar 26, 2020
Citations: 23	License type: CC BY 4.0

Affiliation: Korea University, Sejong University

Abstract

Facial landmarks represent prominent feature points on the face that can be used as anchor points in many face-related tasks. So far, a lot of research has been done with the aim of achieving efficient extraction of landmarks from facial images. Employing a large number of feature points for landmark detection and tracking usually requires excessive processing time. On the contrary, relying on too few feature points cannot accurately represent diverse landmark properties, such as shape. To extract the 68 most popular facial landmark points efficiently, in our previous study, we proposed a model called EMTCNN that extended the multi-task cascaded convolutional neural network for real-time face landmark detection. To improve the detection accuracy, in this study, we augment the EMTCNN model by using two convolution techniques—dilated convolution and CoordConv. The former makes it possible to increase the filter size without a significant increase in computation time. The latter enables the spatial coordinate information of landmarks to be reflected in the model. We demonstrate that our model can improve the detection accuracy while maintaining the processing speed.

Highlights

Facial landmarks such as eyes, nose, and mouth are prominent feature points on the face, and diverse tasks such as face recognition, gaze detection, person tracking, emotion recognition, and virtual makeup have been performed based on facial landmarks [1,2]
Sci. 2020, 10, 2253 time, we proposed an EMTCNN model by extending the original multi-task cascaded convolutional neural network (MTCNN) model [12] which extracts five facial landmark points in real time
MTCNN is a cascaded structure composed of relatively light convolutional neural networks (CNNs) including a proposal network (P-Net), refinement network (R-Net), and output network (O-Net)

Summary

Introduction

Facial landmarks such as eyes, nose, and mouth are prominent feature points on the face, and diverse tasks such as face recognition, gaze detection, person tracking, emotion recognition, and virtual makeup have been performed based on facial landmarks [1,2]. In an effort to detect such facial landmark points accurately, adding more convolution layers has been attempted, as in Visual Geometry Group Network (VGGNet) [9,10] Even though this produces better results, it requires more computational resources and is not appropriate for real-time processing. Sci. 2020, 10, 2253 time, we proposed an EMTCNN model by extending the original multi-task cascaded convolutional neural network (MTCNN) model [12] which extracts five facial landmark points in real time. CoordConv [17]—to improve the detection accuracy while maintaining the processing speed The former makes it possible to extend the receptive field without increasing the number of parameters.

Related Works

Materials and Methods

EMTCNN Augmentation

CoordConv Layer

CoordConv

Dataset

Experiment

Training

Accuracy of Landmark Point Extraction

Method

Effects of Weights on Accuracy

Findings

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Augmented EMTCNN: A Fast and Accurate Facial Landmark Detection Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Acquiring 3D scene information from 2D images

-

01 May 2004
01 May 2004

The face of art
Jordan Yaniv ... Ariel Shamir
ACM Transactions on Graphics | VOL. 38
Jordan Yaniv, et. al.Jordan Yaniv ... Ariel Shamir
12 Jul 2019
ACM Transactions on Graphics | VOL. 38

Detecting Facial Region and Landmarks at Once via Deep Network
Taehyung Kim ... Euichul Lee
Sensors | VOL. 21
Taehyung Kim, et. al.Taehyung Kim ... Euichul Lee
09 Aug 2021
Sensors | VOL. 21

Fast Anchor Point Matching for Emergency UAV Image Stitching Using Position and Pose Information.
Ruizhe Shao ... Jun Li
Sensors | VOL. 20
Ruizhe Shao, et. al.Ruizhe Shao ... Jun Li
03 Apr 2020
Sensors | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Augmented EMTCNN: A Fast and Accurate Facial Landmark Detection Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences