Joint Human Detection and Head Pose Estimation via Multistream Networks for RGB-D Videos

Guyue Zhang,Larry S Davis,Hengduo Li,Jun Liu,Yan Qiu Chen

doi:10.1109/lsp.2017.2731952

Joint Human Detection and Head Pose Estimation via Multistream Networks for RGB-D Videos

Guyue Zhang, Larry S Davis + Show 3 more

https://doi.org/10.1109/lsp.2017.2731952

Copy DOI

Journal: IEEE Signal Processing Letters	Publication Date: Nov 1, 2017
Citations: 95

Affiliation: Fudan University, University of Maryland, College Park, Nanyang Technological University

#Head Pose Estimation #Joint Head + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We propose a multistream multitask deep network for joint human detection and head pose estimation in RGB-D videos. To achieve high accuracy, we jointly utilize appearance, shape, and motion information as inputs. Based on the depth information, we generate scale invariant proposals, which are then fed into a novel contextual region of interest pooling (CRP) layer in our deep network. This CRP has two branches to deal with contextual information for each subject. The proposed method outperforms state-of-the-art approaches on three public datasets.

Full Text