LOTR: Face Landmark Localization Using Localization Transformer

Ukrit Watchareeruetai,Sanjana Jain,Benjaphan Sommana,Ankush Ganguly,Nakarin Sritrakool,Aubin Samacoits,Samuel W F Earp,Pavit Noinongyao

doi:10.1109/access.2022.3149380

Ukrit Watchareeruetai, Sanjana Jain + Show 6 more

Open Access

https://doi.org/10.1109/access.2022.3149380

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 12	License type: CC BY-NC-ND 4.0

Affiliation: Chulalongkorn University

Abstract

This paper presents a novel Transformer-based facial landmark localization network named Localization Transformer (LOTR). The proposed framework is a direct coordinate regression approach leveraging a Transformer network to better utilize the spatial information in a feature map. An LOTR model consists of three main modules: 1) a visual backbone that converts an input image into a feature map, 2) a Transformer module that improves the feature representation from the visual backbone, and 3) a landmark prediction head that directly predicts landmark coordinates from the Transformer’s representation. Given cropped-and-aligned face images, the proposed LOTR can be trained end-to-end without requiring any post-processing steps. This paper also introduces a loss function named smooth-Wing loss, which addresses the gradient discontinuity of the Wing loss, leading to better convergence than standard loss functions such as L1, L2, and Wing loss. Experimental results on the JD landmark dataset provided by the First Grand Challenge of 106-Point Facial Landmark Localization indicate the superiority of LOTR over the existing methods on the leaderboard and two recent heatmap-based approaches. On the WFLW dataset, the proposed LOTR framework demonstrates promising results compared with several state-of-the-art methods. Additionally, we report an improvement in the performance of state-of-the-art face recognition systems when using our proposed LOTRs for face alignment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LOTR: Face Landmark Localization Using Localization Transformer

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Multi-modal ear and face modeling and recognition
Mohammad H Mahoor ... Mohamed Abdel-Mottaleb
-
Mohammad H Mahoor, et. al.Mohammad H Mahoor ... Mohamed Abdel-Mottaleb
01 Nov 2009
01 Nov 2009

Rectified Wing Loss for Efficient and Robust Facial Landmark Localisation with Convolutional Neural Networks
Zhen-Hua Feng ... Josef Kittler
International Journal of Computer Vision | VOL. 128
Zhen-Hua Feng, et. al.Zhen-Hua Feng ... Josef Kittler
17 Dec 2019
International Journal of Computer Vision | VOL. 128

Landmark localization approach for facial computing
Raphael Angulu ... Aderemi O Adewumi
-
Raphael Angulu, et. al.Raphael Angulu ... Aderemi O Adewumi
01 Mar 2017
01 Mar 2017

Multi-modal biometric modeling and recognition of the human face and ear
Steven Cadavid ... Mohamed Abdel-Mottaleb
-
Steven Cadavid, et. al.Steven Cadavid ... Mohamed Abdel-Mottaleb
01 Nov 2009
01 Nov 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LOTR: Face Landmark Localization Using Localization Transformer

Abstract

Talk to us

Similar Papers

More From: IEEE Access