Hierarchical Binary CNNs for Landmark Localization with Limited Resources

Adrian Bulat,Georgios Tzimiropoulos

doi:10.1109/tpami.2018.2866051

Abstract

Our goal is to design architectures that retain the groundbreaking performance of Convolutional Neural Networks (CNNs) for landmark localization and at the same time are lightweight, compact and suitable for applications with limited computational resources. To this end, we make the following contributions: (a) we are the first to study the effect of neural network binarization on localization tasks, namely human pose estimation and face alignment. We exhaustively evaluate various design choices, identify performance bottlenecks, and more importantly propose multiple orthogonal ways to boost performance. (b) Based on our analysis, we propose a novel hierarchical, parallel and multi-scale residual architecture that yields large performance improvement over the standard bottleneck block while having the same number of parameters, thus bridging the gap between the original network and its binarized counterpart. (c) We perform a large number of ablation studies that shed light on the properties and the performance of the proposed block. (d) We present results for experiments on the most challenging datasets for human pose estimation and face alignment, reporting in many cases state-of-the-art performance. (e) We further provide additional results for the problem of facial part segmentation. Code can be downloaded from https://www.adrianbulat.com/binary-cnn-landmarks.

Highlights

THIS work is on localizing a predefined set of fiducial points on objects of interest which can typically undergo non-rigid deformations like the human body or face
Work based on Convolutional Neural Networks (CNNs) has revolutionized landmark localization, demonstrating results of remarkable accuracy even on the most challenging datasets for human pose estimation [1], [2], [3] and face alignment [4]
They behave as simple filters deciding when a certain value should be passed or not. This allows the input to pass through the layer with little modifications, sometimes blocking “good features” and hurting the overall performance by a noticeable amount. This is problematic for the task of landmark localization, where a high level of detail is required for successful localization

Summary

Introduction

THIS work is on localizing a predefined set of fiducial points on objects of interest which can typically undergo non-rigid deformations like the human body or face. Work based on Convolutional Neural Networks (CNNs) has revolutionized landmark localization, demonstrating results of remarkable accuracy even on the most challenging datasets for human pose estimation [1], [2], [3] and face alignment [4]. This work is on highly accurate and robust yet efficient and lightweight landmark localization using binarized CNNs. Our work is inspired by recent results of binarized CNN architectures on image classification [5], [6]. Our work is inspired by recent results of binarized CNN architectures on image classification [5], [6] Contrary to these works, we are the first to study the effect of neural network binarization on fine-grained tasks like landmark localization. We are interested only in the latter one which was designed to reduce the number of parameters and keep the

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Aug 23, 2018
Citations: 80	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

Hierarchical Binary CNNs for Landmark Localization with Limited Resources

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Similar Papers

Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment with Limited Resources
Adrian Bulat ... Georgios Tzimiropoulos
-
Adrian Bulat, et. al.Adrian Bulat ... Georgios Tzimiropoulos
01 Oct 2017
01 Oct 2017

Multiple local 3D CNNs for region-based prediction in smart cities
Yibi Chen ... Cen Chen
Information Sciences | VOL. 542
Yibi Chen, et. al.Yibi Chen ... Cen Chen
25 Jun 2020
Information Sciences | VOL. 542

Fast and robust segmentation of the striatum using deep convolutional neural networks
Hongyoon Choi ... Kyong Hwan Jin
Journal of Neuroscience Methods | VOL. 274
Hongyoon Choi, et. al.Hongyoon Choi ... Kyong Hwan Jin
21 Oct 2016
Journal of Neuroscience Methods | VOL. 274

Pedestrian gender classification using combined global and local parts-based convolutional neural networks
Choon-Boon Ng ... Yong-Haur Tay
Pattern Analysis and Applications | VOL. 22
Choon-Boon Ng, et. al.Choon-Boon Ng ... Yong-Haur Tay
31 Jul 2018
Pattern Analysis and Applications | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hierarchical Binary CNNs for Landmark Localization with Limited Resources

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence