Abstract
AbstractIn exemplar-based approaches for human pose estimation, it is common to extract multiple features to better describe the visual input data. However, simply concatenating multiview features into a long vector has two shortcomings: (1) it suffers from “curse of dimensionality”; (2) it is not physically meaningful and may be incapable of fully exploiting the complementary properties of multi-view features. To address such problems, in this paper we present a dimension reduction method based on supervised spectral embedding, followed by an ensemble of nearest neighbor regressions in multi-view feature space, to infer 3D human poses from monocular videos. The experiments on HumanEva dataset show the effectiveness of the proposed method.KeywordsHuman pose estimationSpectral embedding k-NN regression
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.