Abstract
In real-world applications, factors such as illumination variation, occlusion, and poor image quality, etc. make head detection and pose estimation much more challenging. In this paper, we propose a multi-level structured hybrid forest (MSHF) for joint head detection and pose estimation. Our method extends the hybrid framework of classification and regression forests by introducing multi-level splitting functions and multi-structural features. Multi-level splitting functions are used to construct trees in different layers of MSHF. Multi-structured features are extracted from randomly selected image patches, which are either head region or the background. The head contour is derived from these patches using the signed distance of the patch center to the head contour by MSHF regression. The randomly selected sub-regions from the patches within the head contour are used to develop the MSHF for head pose estimation in a coarse-to-fine manner. The weighted neighbor structured aggregation integrates votes from trees to achieve an estimation of continuous pose angles. Experiments were conducted using public datasets and video streams. Compared to the state-of-the-art methods, MSHF achieved improved performance and great robustness with an average accuracy of 90% and the average angular error of 6.6°. The averaged time for performing a joint head detection and pose estimation is about 0.44 s.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.