In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations

Ikhsanul Habibie,Dushyant Mehta,Christian Theobalt,Weipeng Xu,Gerard Pons-Moll

doi:10.1109/cvpr.2019.01116

Abstract

Convolutional Neural Network based approaches for monocular 3D human pose estimation usually require a large amount of training images with 3D pose annotations. While it is feasible to provide 2D joint annotations for large corpora of in-the-wild images with humans, providing accurate 3D annotations to such in-the-wild corpora is hardly feasible in practice. Most existing 3D labelled data sets are either synthetically created or feature in-studio images. 3D pose estimation algorithms trained on such data often have limited ability to generalize to real world scene diversity. We therefore propose a new deep learning based method for monocular 3D human pose estimation that shows high accuracy and generalizes better to in-the-wild scenes. It has a network architecture that comprises a new disentangled hidden space encoding of explicit 2D and 3D features, and uses supervision by a new learned projection model from predicted 3D pose. Our algorithm can be jointly trained on image data with 3D labels and image data with only 2D labels. It achieves state-of-the-art accuracy on challenging in-the-wild data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Multi-View Pose Generator Based on Deep Learning for Monocular 3D Human Pose Estimation
Jun Sun ... Dejun Zhang
Symmetry | VOL. 12
Jun Sun, et. al.Jun Sun ... Dejun Zhang
04 Jul 2020
Symmetry | VOL. 12

Modeling vs. learning approaches for monocular 3D human pose estimation
Wenjuan Gong ... Michael Arens
-
Wenjuan Gong, et. al.Wenjuan Gong ... Michael Arens
01 Nov 2011
01 Nov 2011

Dual Networks Based 3D Multi-Person Pose Estimation From Monocular Video.
Yu Cheng ... Robby T Tan
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Yu Cheng, et. al.Yu Cheng ... Robby T Tan
01 Feb 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

A self-supervised spatio-temporal attention network for video-based 3D infant pose estimation
Wang Yin ... Ming Yi
Medical Image Analysis | VOL. 96
Wang Yin, et. al.Wang Yin ... Ming Yi
18 May 2024
Medical Image Analysis | VOL. 96

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations

Abstract

Talk to us

Similar Papers